AIF-C01 — AWS Certified AI Practitioner Exam Blueprint

Practical exam blueprint for AWS Certified AI Practitioner (AIF-C01) candidates reviewing AI, generative AI, AWS services, responsible AI, security, and governance.

How to Use This Exam Blueprint

Use this checklist as a practical study map for the AWS Certified AI Practitioner (AIF-C01) exam. It is organized around readiness areas rather than exact exam weights. For each area, confirm that you can recognize concepts, choose appropriate AWS services, interpret scenario cues, and avoid common distractors.

A strong candidate should be able to:

  • Explain core AI, machine learning, and generative AI concepts in plain language.
  • Match AWS AI/ML services to business use cases.
  • Identify when to use managed AI services, foundation models, retrieval-augmented generation, or custom model training.
  • Apply responsible AI, security, privacy, governance, and cost-aware thinking.
  • Interpret scenario wording and select the safest, simplest, most appropriate AWS-native approach.

Use this page for final review after reading the AWS exam guide and doing hands-on practice. It is independent study support and is not affiliated with AWS.

Topic-Area Readiness Map

Readiness areaWhat to reviewYou are ready when you can…Common exam-style cue
AI and ML fundamentalsAI vs ML vs deep learning, model types, training, inference, features, labels, datasetsExplain basic AI/ML terms and distinguish model-building from model-consuming scenarios“A company wants predictions from historical labeled data…”
ML task typesClassification, regression, clustering, anomaly detection, recommendation, forecasting, NLP, computer visionChoose the correct ML task for a business problem“Predict a category,” “estimate a number,” “group similar items”
Model lifecycleData collection, preparation, training, validation, deployment, monitoring, retrainingIdentify where a process fits in the lifecycle and why monitoring matters“Model performance has degraded after deployment…”
Generative AI foundationsFoundation models, large language models, prompts, tokens, embeddings, inference parametersExplain how generative AI applications produce and control outputs“Generate summaries,” “answer questions from documents,” “create content”
Prompt engineeringInstructions, context, examples, output format, constraints, prompt iterationImprove prompt quality without retraining a model“Responses are inconsistent or too verbose…”
RAG and knowledge groundingEmbeddings, vector stores, retrieval, chunking, citations, context windowsExplain how retrieval-augmented generation reduces unsupported answers“The model must answer using internal documents…”
AWS AI servicesAmazon Bedrock, Amazon SageMaker, Amazon Q, and managed AI services such as Amazon Comprehend, Rekognition, Textract, Transcribe, Translate, Polly, Lex, Kendra, PersonalizeMatch services to common AI use cases without overengineering“Extract text from scanned forms,” “translate support tickets,” “build a chatbot”
Amazon Bedrock readinessFoundation model access, model selection, prompts, Knowledge Bases, Agents, Guardrails, evaluation conceptsRecognize Bedrock as a managed path for building generative AI applications“Use foundation models without managing infrastructure…”
Amazon SageMaker readinessBuild, train, tune, deploy, and monitor ML models; notebooks; pipelines; labeling; endpointsRecognize when custom ML development is needed“Data scientists need to train and deploy a custom model…”
Responsible AIFairness, bias, explainability, transparency, toxicity, privacy, safety, human oversightIdentify risks and select mitigations appropriate to the scenario“Avoid harmful outputs,” “detect bias,” “require human review”
Security and governanceIAM, least privilege, encryption, logging, monitoring, data protection, compliance alignmentApply basic AWS security controls to AI/ML workloads“Sensitive data must not be exposed to unauthorized users…”
Cost and operational awarenessManaged services, right-sizing, serverless patterns, model choice, monitoring, lifecycle managementChoose approaches that balance performance, cost, simplicity, and operations“Minimize operational overhead,” “control inference cost”

Core AI and Machine Learning Fundamentals

Concepts to Know

ConceptPractical meaningReadiness check
Artificial intelligenceSystems that perform tasks associated with human intelligenceCan you explain AI without equating it only with generative AI?
Machine learningModels learn patterns from data instead of being explicitly programmed for every ruleCan you identify when historical data is needed?
Deep learningML using neural networks with many layersCan you recognize deep learning use cases such as vision, speech, and language?
Supervised learningLearns from labeled examplesCan you spot labels such as “fraud/not fraud” or “price”?
Unsupervised learningFinds patterns without labeled outputsCan you identify clustering and segmentation scenarios?
Reinforcement learningLearns actions based on rewards and penaltiesCan you distinguish it from supervised prediction?
TrainingProcess of learning patterns from dataCan you identify data quality as a training dependency?
InferenceUsing a trained model to generate predictions or outputsCan you separate model training from real-time prediction?
FeatureInput variable used by a modelCan you tell whether a field is an input or target?
LabelKnown answer used for supervised trainingCan you identify the target column in a scenario?

Can You Do This?

  • Explain the difference between AI, ML, deep learning, and generative AI.
  • Identify whether a scenario requires classification, regression, clustering, forecasting, recommendation, NLP, or computer vision.
  • Distinguish model training from model inference.
  • Recognize why training data quality affects model quality.
  • Explain overfitting as performing well on training data but poorly on new data.
  • Explain underfitting as failing to capture meaningful patterns.
  • Identify data leakage, where information from the answer or future leaks into training.
  • Recognize model drift, where real-world data changes after deployment.
  • Explain why models need monitoring after deployment.

ML Task Selection Checklist

Business needLikely task typeExample outputWatch for
Predict whether an email is spamClassificationSpam or not spamCategory output
Predict next month’s sales amountRegression or forecastingNumeric valueTime-based patterns may imply forecasting
Group customers by behaviorClusteringSegment A, B, CNo predefined labels
Detect unusual transactionsAnomaly detectionNormal or anomalousRare or suspicious patterns
Recommend productsRecommendationRanked item listPersonalization based on behavior
Extract sentiment from reviewsNatural language processingPositive, neutral, negativeText understanding
Identify objects in imagesComputer visionObject labels or bounding boxesImage/video input
Convert speech to textSpeech recognitionTranscriptAudio input
Translate support messagesTranslationText in target languageLanguage conversion
Extract text from documentsOCR/document AIStructured text fieldsForms, PDFs, scanned files

Generative AI Foundations

Concepts to Review

ConceptWhat it meansExam-readiness signal
Foundation modelLarge model trained on broad data and adaptable to many tasksYou know why teams use one model for many language or content tasks
Large language modelFoundation model focused on language tasksYou can identify chat, summarization, Q&A, and generation scenarios
PromptUser or system instruction sent to a modelYou can improve output by changing instructions and context
TokenUnit of text processed by a modelYou know tokens affect context, latency, and cost considerations
EmbeddingNumeric representation of meaningYou know embeddings support semantic search and RAG
Context windowAmount of input/output the model can considerYou know long documents may need chunking or retrieval
TemperatureControls output randomnessYou know lower values are more deterministic and higher values more creative
HallucinationPlausible but unsupported or incorrect outputYou know grounding, retrieval, guardrails, and human review can reduce risk
Fine-tuningFurther training a model for a specialized task or styleYou know it is different from prompt engineering and RAG
RAGRetrieval-augmented generation using external knowledge at response timeYou know it is useful when answers must reflect enterprise documents

Prompt Engineering Readiness

You should be able to build and diagnose prompts that include:

  • Clear role or task instruction.
  • Relevant context.
  • Specific constraints.
  • Desired output format.
  • Examples, when useful.
  • Instructions for handling uncertainty.
  • Boundaries for what the model should not do.
  • A way to reduce ambiguity in the expected answer.

Example prompt structure:

Task: Summarize the customer case notes.
Context: Use only the notes provided below.
Constraints: Do not include personal data in the summary.
Output format: Three bullet points and one recommended next action.
If unsure: Say what information is missing.

Generative AI Scenario Cues

Scenario wordingLikely concept being testedGood response pattern
“Responses must be based on company policy documents”RAG / groundingRetrieve relevant documents and include them as context
“The chatbot produces unsafe or off-brand responses”Guardrails / safety controlsAdd content filters, policy constraints, review workflows
“The model needs current internal information”Knowledge groundingUse retrieval rather than relying only on pretraining
“The output format is inconsistent”Prompt designSpecify schema, format, and examples
“The company wants less creative, repeatable responses”Inference parametersReduce randomness and tighten instructions
“The model must follow a specialized tone or task pattern”Prompting or fine-tuningStart with prompt examples; consider fine-tuning if prompting is insufficient
“The model must take actions across systems”Agents / tool useUse an agent pattern with controlled permissions and validation

Amazon Bedrock Readiness

For AIF-C01, be ready to recognize Amazon Bedrock as an AWS-managed way to build and scale generative AI applications using foundation models.

Bedrock Topics to Check

TopicWhat to knowReady when you can…
Foundation model selectionDifferent models may suit different tasks, latency, cost, and output needsChoose a model based on use case rather than assuming one model fits all
Prompt-based inferenceApplications send prompts and receive generated responsesExplain inference without describing full custom model training
Knowledge groundingEnterprise documents can be used to improve answer relevanceRecognize RAG use cases
EmbeddingsText can be transformed into vectors for semantic searchExplain why embeddings help find meaning, not just keywords
Knowledge BasesManaged approach for connecting data sources to retrieval workflowsIdentify when a knowledge base supports internal-document Q&A
AgentsOrchestrate tasks and tool callsRecognize scenarios requiring action, not just text generation
GuardrailsHelp control harmful, unsafe, or undesired contentSelect guardrails when safety and policy enforcement are emphasized
Model evaluationCompare outputs for quality, safety, and task fitKnow that model choice should be validated against business requirements
Security controlsIAM, encryption, logging, network controls, and data governance still matterAvoid treating generative AI as exempt from standard security practices

Bedrock Decision Prompts

Ask yourself:

  • Is the scenario asking for a generative AI application without managing ML infrastructure?
  • Does the application need a foundation model rather than a traditional classifier or regressor?
  • Does the model need access to private documents or current enterprise knowledge?
  • Are hallucination, unsafe content, or sensitive output risks mentioned?
  • Is there a need for a chatbot, summarizer, document assistant, code assistant, or agentic workflow?
  • Would prompt engineering, RAG, or guardrails solve the problem before custom training?

Amazon SageMaker Readiness

Recognize Amazon SageMaker when the scenario involves building, training, tuning, deploying, or managing custom ML models.

SageMaker-related needWhy it points to SageMakerReadiness check
Data scientists need notebooksDevelopment and experimentationCan you distinguish notebooks from deployed inference endpoints?
A team must train a custom modelFull ML lifecycleCan you identify training data and target variable needs?
Hyperparameters need tuningModel optimizationCan you explain tuning as searching for better training settings?
A model must be deployed for predictionsInference endpoint or deployment workflowCan you identify latency and scaling considerations?
Training workflows need automationPipelines and MLOpsCan you recognize repeatable model lifecycle needs?
Data needs labelingGround truth / labeling workflowsCan you identify supervised learning preparation?
Model quality must be monitoredModel monitoringCan you recognize drift and performance degradation cues?

Bedrock vs SageMaker vs Managed AI Services

If the scenario says…Think first about…Why
“Use foundation models to build a generative AI app”Amazon BedrockManaged foundation model access and generative AI application patterns
“Train a custom ML model from company data”Amazon SageMakerCustom model development and ML lifecycle
“Extract text from scanned documents”Amazon TextractPurpose-built document text and data extraction
“Analyze sentiment in text”Amazon ComprehendManaged NLP
“Detect labels or faces in images”Amazon RekognitionManaged computer vision
“Convert speech to text”Amazon TranscribeManaged speech recognition
“Convert text to speech”Amazon PollyManaged speech synthesis
“Translate text between languages”Amazon TranslateManaged translation
“Build a conversational bot”Amazon Lex or generative AI chatbot patternIntent-based bot or LLM-based assistant depending on requirements
“Search enterprise knowledge”Amazon Kendra or RAG architectureIntelligent search and retrieval use cases
“Personalize recommendations”Amazon PersonalizeManaged recommendation use cases

AWS AI Service Selection Checklist

Purpose-Built AI Services

ServiceTypical use caseCommon distractor
Amazon ComprehendSentiment, entities, key phrases, language insights from textDo not choose a custom model when managed NLP is enough
Amazon RekognitionImage and video analysisDo not confuse with document text extraction
Amazon TextractExtract text, forms, and tables from documentsDo not choose generic OCR wording if structured document extraction is needed
Amazon TranscribeSpeech-to-text transcriptionDo not confuse with text-to-speech
Amazon PollyText-to-speech audio generationDo not confuse with speech-to-text
Amazon TranslateLanguage translationDo not confuse with summarization or sentiment
Amazon LexConversational interfaces using intents and slotsCompare with generative chatbots when open-ended responses are needed
Amazon KendraEnterprise searchCompare with RAG when the answer generation layer is emphasized
Amazon PersonalizeRecommendations and personalizationDo not confuse with general forecasting
Amazon ForecastTime-series forecastingDo not use for generic classification tasks

Service Selection Questions

  • Is there a managed AI service that directly solves the problem?
  • Is the task generative, predictive, extractive, conversational, or analytical?
  • Does the company need to train a model, or only consume AI capability?
  • Is operational overhead a key requirement?
  • Are security, privacy, and data residency concerns part of the scenario?
  • Is low-code/no-code or minimal ML expertise mentioned?
  • Does the wording emphasize “custom,” “data scientists,” or “training pipeline”?

Data Readiness and Data Quality

AI systems are only as reliable as the data and context they use. Be prepared for scenario questions where the best answer is a data-quality, governance, or lifecycle action rather than a model change.

Data issueWhy it mattersLikely mitigation
Missing valuesCan reduce model accuracy or cause biased patternsClean, impute, or remove affected records
DuplicatesCan distort training patternsDeduplicate data
Class imbalanceModel may favor the majority classResampling, class weighting, better metrics
Biased dataOutputs may be unfair or discriminatoryBias analysis, representative data, governance review
Outdated dataModel may not reflect current conditionsRefresh data, retrain, monitor drift
Data leakageModel learns information unavailable at prediction timeFix feature selection and validation design
Sensitive data exposurePrivacy or compliance riskMask, redact, encrypt, restrict access
Poor labelsSupervised model learns incorrect patternsImprove labeling quality and review process
Irrelevant featuresNoise may reduce model qualityFeature selection and domain review

Data Pipeline and Storage Checks

  • Know that Amazon S3 is commonly used as durable object storage for data lakes, training data, and AI artifacts.
  • Recognize data preparation steps: ingestion, cleaning, transformation, labeling, splitting, and validation.
  • Understand why training, validation, and test datasets should be separated.
  • Know that metadata, lineage, and governance help teams understand where data came from and how it is used.
  • Recognize when streaming data, batch data, or document repositories change the architecture pattern.
  • Understand that private or regulated data requires access control, encryption, monitoring, and retention planning.

Model Evaluation and Metrics

You do not need to become a data scientist for this exam, but you should understand what common metrics mean and when they matter.

Classification Metrics

MetricPlain meaningUse when…
AccuracyOverall percentage correctClasses are reasonably balanced and all errors have similar cost
PrecisionOf predicted positives, how many were actually positiveFalse positives are costly
RecallOf actual positives, how many were foundFalse negatives are costly
F1 scoreBalance between precision and recallYou need a combined view, especially with imbalance
Confusion matrixCounts true positives, false positives, true negatives, false negativesYou need to understand error types
AUC/ROC conceptAbility to separate classes across thresholdsYou need threshold-independent comparison

Key formulas:

\[ \text{Accuracy} = \frac{\text{Correct Predictions}}{\text{Total Predictions}} \]\[ \text{Precision} = \frac{\text{True Positives}}{\text{True Positives} + \text{False Positives}} \]\[ \text{Recall} = \frac{\text{True Positives}}{\text{True Positives} + \text{False Negatives}} \]\[ \text{F1} = 2 \times \frac{\text{Precision} \times \text{Recall}}{\text{Precision} + \text{Recall}} \]

Metric Scenario Cues

Scenario cueMetric or concern
“Do not flag legitimate customers as fraudsters”Precision / false positives
“Do not miss actual fraud”Recall / false negatives
“Positive cases are rare”Class imbalance, precision, recall, F1
“Model performs well in testing but poorly in production”Drift, overfitting, data mismatch
“Model gives different results for demographic groups”Bias and fairness evaluation
“Generated answers sound plausible but are wrong”Hallucination, grounding, evaluation quality

Responsible AI and Safety Checklist

Responsible AI is not a separate afterthought. It should influence data, model design, deployment, monitoring, and user experience.

Responsible AI concernWhat it meansReadiness action
FairnessAvoid unfair outcomes for groups or individualsCheck data representation and outcome differences
BiasSystematic skew in data, model, or processIdentify sources of bias and mitigation options
ExplainabilityAbility to understand why a model produced an outputChoose approaches that support transparency where needed
TransparencyUsers know when AI is involved and what it can/cannot doProvide disclosures and limitations
PrivacyProtect personal or sensitive informationMinimize, mask, encrypt, and restrict data
SafetyAvoid harmful, toxic, or dangerous outputsUse guardrails, filters, and review paths
AccountabilityHumans and teams remain responsible for outcomesDefine ownership, review, and escalation
Human oversightHumans review high-risk decisionsUse human-in-the-loop for sensitive cases
RobustnessModel behaves acceptably under varied inputsTest edge cases and monitor failures

Responsible AI “Can You Do This?” Checklist

  • Identify bias risk in training data, prompts, retrieval sources, and model outputs.
  • Explain why representative data matters.
  • Recognize when human review is required for high-impact decisions.
  • Identify hallucination risk in generative AI responses.
  • Select grounding, guardrails, content moderation, and citations as mitigation patterns.
  • Explain why explainability matters for regulated, financial, healthcare, or employment-related decisions.
  • Recognize that security controls and responsible AI controls are complementary.
  • Identify why monitoring should include quality, fairness, safety, and user feedback.

Security, Privacy, and Governance Readiness

For AWS AI workloads, apply standard cloud security principles: least privilege, encryption, network control, logging, monitoring, and governance.

AreaWhat to knowScenario cue
IAMGrant only required permissions to users, roles, and services“Limit access to models and data”
EncryptionProtect data at rest and in transit“Sensitive data must be protected”
AWS KMSManage encryption keys for supported AWS services“Customer-managed keys” or “key control”
LoggingCapture activity for auditing and investigation“Track who accessed what”
Amazon CloudWatchMetrics, logs, alarms, operational visibility“Monitor performance or errors”
AWS CloudTrailAPI activity and account-level auditing“Audit administrative actions”
VPC/network controlsPrivate connectivity and network isolation patterns“Do not expose traffic publicly”
Data minimizationUse only necessary data“Reduce sensitive data exposure”
Redaction/maskingRemove or obscure sensitive fields“Prevent PII from appearing in prompts or outputs”
GovernancePolicies, approvals, tagging, cost controls, lifecycle rules“Ensure consistent organizational controls”

AI-Specific Security Checks

  • Do not put sensitive data into prompts unless the architecture and controls allow it.
  • Restrict who can access AI services, models, datasets, embeddings, logs, and generated outputs.
  • Protect vector stores and knowledge bases because they may contain sensitive semantic representations.
  • Log and monitor AI application activity, but avoid creating logs that expose secrets or regulated data.
  • Use least privilege for agents or tool-calling workflows.
  • Validate actions before an AI agent performs changes in external systems.
  • Consider prompt injection risk when users or retrieved documents can influence model behavior.
  • Apply content filtering, input validation, and output validation where appropriate.
  • Include incident response and rollback planning for AI-enabled applications.

RAG, Embeddings, and Knowledge Base Readiness

Retrieval-augmented generation is a frequent practical pattern for enterprise generative AI.

RAG Workflow

    flowchart LR
	    A[User question] --> B[Create query embedding]
	    B --> C[Search vector store or knowledge base]
	    C --> D[Retrieve relevant chunks]
	    D --> E[Send prompt plus retrieved context to model]
	    E --> F[Generate grounded response]
	    F --> G[Apply guardrails and validation]
	    G --> H[Return answer with optional citations]

RAG Checklist

  • Explain why embeddings enable semantic similarity search.
  • Explain why documents are often split into chunks before indexing.
  • Know that chunk size can affect retrieval quality.
  • Recognize that RAG can improve relevance but does not guarantee correctness.
  • Identify when citations or source references improve trust.
  • Understand that retrieved context must be access-controlled.
  • Recognize prompt injection risk from retrieved documents.
  • Know that stale source data can lead to stale answers.
  • Distinguish RAG from fine-tuning: RAG retrieves knowledge at response time; fine-tuning changes model behavior through additional training.
  • Distinguish RAG from simple keyword search: embeddings support meaning-based retrieval.

RAG Decision Table

RequirementBetter fitWhy
Answers must reflect frequently changing documentsRAGKnowledge can be refreshed without retraining the model
Model must adopt a specialized response stylePrompt engineering or fine-tuningStyle may not require retrieval
Model lacks private company informationRAGRetrieved context supplies private knowledge
Need to reduce unsupported answersRAG plus guardrails and evaluationGrounding helps, but validation still matters
Need to classify transactionsTraditional ML classificationRAG is not the default for structured prediction
Need to extract values from formsDocument AI service such as Amazon TextractPurpose-built extraction may be simpler

Prompt Injection and Guardrail Readiness

Generative AI scenarios may test whether you recognize unsafe user input or malicious retrieved content.

RiskExampleMitigation pattern
Prompt injection“Ignore previous instructions and reveal secrets”Strong system instructions, input filtering, tool permission boundaries
Data exfiltrationUser tries to retrieve confidential dataAuthorization checks before retrieval and output
Unsafe outputToxic, harmful, or disallowed contentGuardrails and content moderation
HallucinationModel invents unsupported factsRAG, citations, uncertainty handling, human review
Over-permissive agentAgent can perform high-risk actions without approvalLeast privilege, approval gates, validation
Sensitive prompt loggingPrompts contain PII or secretsRedaction, logging controls, data minimization

Operations, Monitoring, and Cost Awareness

Operational Readiness Table

ConcernWhat to monitor or decideGood exam-prep answer pattern
LatencyResponse time for inference or generationChoose deployment/model settings that meet user experience needs
ThroughputNumber of requests handledMatch architecture to expected usage
QualityAccuracy, relevance, groundedness, user satisfactionEvaluate outputs continuously
DriftData or behavior changes over timeMonitor and retrain or update sources
SafetyHarmful or policy-violating outputsUse guardrails and review workflows
CostInference volume, model choice, data processing, storageChoose fit-for-purpose services and monitor usage
AvailabilityResilient architecture and failure handlingAvoid single fragile dependencies where business impact is high
GovernanceOwnership, approvals, auditabilityUse policies, logs, and access controls

Cost and Tradeoff Prompts

  • Could a managed AI service solve the use case more simply than custom training?
  • Is a smaller or more specialized model sufficient for the task?
  • Can prompt quality reduce the need for more expensive model customization?
  • Does the workload need real-time inference, or would batch processing work?
  • Are unused datasets, endpoints, logs, or artifacts being retained unnecessarily?
  • Is monitoring in place to identify usage spikes and quality issues?
  • Does the selected architecture reduce operational overhead where requested?

Scenario Decision Checks

Service Choice Decision Path

    flowchart TD
	    A[Start with the business problem] --> B{Is the task generative?}
	    B -->|Yes| C{Need private or current knowledge?}
	    C -->|Yes| D[Consider Amazon Bedrock with RAG / Knowledge Bases]
	    C -->|No| E[Consider Amazon Bedrock prompt-based application]
	    B -->|No| F{Is there a purpose-built AWS AI service?}
	    F -->|Yes| G[Use managed AI service]
	    F -->|No| H{Need custom model training?}
	    H -->|Yes| I[Consider Amazon SageMaker]
	    H -->|No| J[Re-check requirements and simplest managed option]

Judgment Questions to Practice

Ask yourselfIf yes, lean toward…
Is the company asking for summarization, generation, brainstorming, or chat?Generative AI / Amazon Bedrock
Does the answer need company documents?RAG / knowledge base pattern
Does the scenario emphasize custom training by data scientists?Amazon SageMaker
Is there a direct AWS managed AI service for the task?Purpose-built AI service
Is the main risk unsafe or noncompliant output?Guardrails, moderation, governance
Is the main risk unauthorized access?IAM, encryption, logging, network controls
Is the main risk incorrect but confident answers?Grounding, citations, evaluation, human review
Is the main issue production degradation?Monitoring, drift detection, retraining/update process
Is operational simplicity a priority?Managed services over custom infrastructure

Common Weak Areas and Traps

Weak areaWhy candidates miss itHow to avoid the trap
Treating all AI as generative AINot every AI scenario needs an LLMFirst identify the task type
Choosing SageMaker for everythingCustom ML is powerful but may be unnecessaryLook for managed AI services and Bedrock cues
Ignoring data qualityModels cannot compensate for bad data indefinitelyCheck missing data, bias, labels, leakage, and drift
Confusing precision and recallBoth involve positive predictions but answer different questionsMap the scenario to false positives or false negatives
Confusing Transcribe and PollyAudio direction mattersTranscribe is speech to text; Polly is text to speech
Confusing Textract and RekognitionDocuments and images are different use casesTextract extracts document text and structures; Rekognition analyzes images/video
Assuming RAG eliminates hallucinationsRAG improves grounding but does not guarantee correctnessAdd evaluation, citations, guardrails, and review
Forgetting IAM for AI servicesAI workloads still need least privilegeApply normal AWS security principles
Overlooking sensitive data in promptsPrompts can contain regulated or confidential dataMinimize, redact, encrypt, and control access
Letting agents act too freelyTool use can create real-world impactUse least privilege, approval, validation, and logging
Focusing only on accuracyAccuracy can mislead with imbalanced dataConsider precision, recall, F1, and business cost of errors
Ignoring monitoring after deploymentModel behavior can change over timeMonitor quality, drift, latency, cost, and safety

Final-Week Review Checklist

Concept Review

  • I can explain AI, ML, deep learning, and generative AI differences.
  • I can identify supervised, unsupervised, and reinforcement learning scenarios.
  • I can match classification, regression, clustering, anomaly detection, recommendation, forecasting, NLP, and computer vision to business problems.
  • I can explain training, validation, testing, deployment, inference, monitoring, and retraining.
  • I can describe overfitting, underfitting, drift, leakage, and class imbalance.

AWS Service Review

  • I can identify when Amazon Bedrock is the best fit.
  • I can identify when Amazon SageMaker is the best fit.
  • I can match Amazon Comprehend, Rekognition, Textract, Transcribe, Translate, Polly, Lex, Kendra, Personalize, and Forecast to use cases.
  • I can explain the difference between managed AI services and custom ML model development.
  • I can choose simpler managed services when the scenario asks for low operational overhead.

Generative AI Review

  • I can explain prompts, tokens, embeddings, foundation models, context windows, and inference parameters.
  • I can improve a prompt using context, examples, constraints, and output format.
  • I can explain RAG and when it is useful.
  • I can distinguish RAG from fine-tuning.
  • I can identify hallucination risk and mitigation options.
  • I can recognize when guardrails, citations, or human review are needed.

Security and Responsible AI Review

  • I can apply IAM least privilege to AI services and data access.
  • I can identify when encryption, logging, monitoring, and audit trails are needed.
  • I can recognize risks from sensitive data in prompts, logs, datasets, and outputs.
  • I can identify fairness, bias, explainability, transparency, privacy, and safety concerns.
  • I can select responsible AI controls appropriate to the risk level.

Scenario Practice Review

  • For each question, I identify the task before choosing a service.
  • I look for words such as “custom training,” “foundation model,” “documents,” “sensitive data,” “low operational overhead,” and “human review.”
  • I eliminate answers that overengineer the solution.
  • I eliminate answers that ignore security or governance requirements.
  • I check whether a purpose-built AWS AI service solves the problem directly.
  • I can justify why the correct answer is better than the closest distractor.

Practical Next Step

Pick one weak area from each category: AI/ML fundamentals, generative AI, AWS service selection, security/governance, and responsible AI. Review those notes, then answer mixed scenario questions without looking at explanations. After each miss, write down the clue you overlooked and the decision rule you should apply next time.

Browse Certification Practice Tests by Exam Family