AIF-C01 — AWS Certified AI Practitioner Quick Reference

Compact AWS Certified AI Practitioner (AIF-C01) reference for AI/ML concepts, AWS services, generative AI, security, governance, and exam decision points.

Exam focus at a glance

This independent Quick Reference supports preparation for the AWS Certified AI Practitioner (AIF-C01) exam from AWS. Use it to quickly connect AI/ML concepts, AWS service choices, responsible AI, and security/governance decisions.

What to recognize quicklyExam-prep cue
AI vs ML vs deep learning vs generative AIAI is the broad field; ML learns from data; deep learning uses neural networks; generative AI creates new content using foundation models.
Managed AI service vs custom ML modelUse managed AI services for common tasks; use Amazon SageMaker when you need custom training, tuning, deployment, or MLOps.
Amazon Bedrock vs Amazon SageMakerBedrock is for building with foundation models and generative AI APIs; SageMaker is for building, training, tuning, deploying, and monitoring ML models.
Prompt engineering vs RAG vs fine-tuningPrompting changes instructions; RAG adds retrieved knowledge; fine-tuning changes model behavior for a task or style.
Responsible AI risksBias, hallucination, toxicity, data leakage, explainability gaps, and misuse.
Security responsibilitiesAWS secures the cloud infrastructure; customers secure identities, data, prompts, access, model use, monitoring, and compliance configuration.
Evaluation metricsMatch metric to problem type: precision/recall/F1 for classification, RMSE/MAE for regression, grounding/toxicity/human review for generative AI.

Core AI, ML, and generative AI vocabulary

TermCompact meaningHigh-yield distinction
Artificial intelligenceSystems that perform tasks associated with human intelligenceBroad umbrella: includes ML, rules, optimization, NLP, vision, robotics
Machine learningModels learn patterns from dataNot explicitly programmed rules for every case
Deep learningML using multi-layer neural networksCommon for language, images, speech, complex patterns
Foundation modelLarge model trained on broad data and adaptable to many tasksBase for generative AI, chat, summarization, embeddings, code, image tasks
Large language modelFoundation model focused on languageGenerates, summarizes, classifies, extracts, reasons over text
Multimodal modelWorks with more than one modalityText plus images, audio, video, or documents
TokenUnit of text processed by a modelCost, latency, and context capacity are affected by token usage
EmbeddingNumeric vector representing semantic meaningUsed for similarity search, clustering, recommendations, and RAG
PromptInput instructions and context sent to a modelBetter prompts can improve output without retraining
InferenceUsing a trained model to make predictions or generate outputDifferent from training or fine-tuning
RAGRetrieval-Augmented GenerationRetrieves trusted context before generation; useful for private/current facts
Fine-tuningAdditional training on labeled or curated examplesBest for task behavior, style, or format; not ideal for constantly changing facts
HallucinationPlausible but incorrect model outputMitigate with grounding, retrieval, validation, guardrails, and human review
BiasSystematic unfairness or skew in data/model outputCan come from data, labels, sampling, features, or feedback loops
FeatureInput variable used by a modelFeature quality strongly affects ML performance
LabelKnown target value for supervised learningExample: fraud/not fraud, price, category
Ground truthTrusted correct answer or labelNeeded for training and evaluation
Training setData used to fit model parametersShould not include validation/test leakage
Validation setData used to tune model choicesHelps select features, hyperparameters, thresholds
Test setHeld-out data for final evaluationShould represent real future data
OverfittingPerforms well on training data but poorly on new dataOften too complex, too little data, or leakage
UnderfittingModel too simple to capture patternsPoor training and validation performance
DriftData or relationship changes over timeRequires monitoring and possible retraining
MLOpsOperational practices for ML lifecycleIncludes versioning, deployment, monitoring, rollback, governance

AWS AI service selection matrix

NeedPrimary AWS service to considerChoose whenCommon trap
Build generative AI app with foundation modelsAmazon BedrockYou need managed access to FMs, chat, summarization, RAG, agents, embeddings, or guardrailsConfusing Bedrock with full custom ML training infrastructure
Build, train, tune, deploy, and monitor custom MLAmazon SageMakerYou need notebooks, training jobs, model registry, endpoints, batch inference, monitoring, or MLOpsChoosing Bedrock when the scenario requires custom training pipelines
No-code/low-code ML for business usersAmazon SageMaker CanvasUsers need predictions without writing codeAssuming every ML scenario requires writing training code
Enterprise assistant over business dataAmazon Q BusinessYou need a managed workplace assistant connected to enterprise contentUsing Bedrock directly when the requirement is a ready business assistant
Developer coding assistantAmazon Q DeveloperYou need code suggestions, explanations, or development assistanceConfusing developer productivity assistant with a general ML platform
Text sentiment, entities, key phrases, language detectionAmazon ComprehendYou need managed NLP extraction/classificationUsing a foundation model for simple standard NLP when a specialized service fits
Speech to textAmazon TranscribeConvert audio into textConfusing with Amazon Polly, which creates speech from text
Text to speechAmazon PollyGenerate natural-sounding speech from textConfusing with Transcribe
Translate textAmazon TranslateMachine translation across languagesUsing custom LLM prompts when a managed translation service is enough
Chatbot/contact center conversational interfaceAmazon LexBuild voice/text conversational botsConfusing with Bedrock chat; Lex manages intents, slots, and bot flow
Enterprise searchAmazon KendraSearch across enterprise documents with relevance and connectorsConfusing with generic vector search or log/search analytics
Search, analytics, vector similarityAmazon OpenSearch ServiceNeed search indexes, log analytics, vector search, similarity retrievalChoosing Kendra when you need lower-level search/vector control
Extract text, tables, forms from documentsAmazon TextractOCR plus structured document extractionConfusing with Rekognition image/video labels
Image/video labels, faces, moderationAmazon RekognitionComputer vision for images/videoUsing Textract for object/face detection
Personalized recommendationsAmazon PersonalizeRecommendations without building the full ML stackAssuming recommendation systems always require SageMaker
Human data labelingAmazon SageMaker Ground TruthNeed labeled datasets with workflows and human reviewTreating unlabeled raw data as ready for supervised learning
Data catalog and ETLAWS GluePrepare/catalog data for analytics or MLConfusing data preparation with model training
Object storage for datasets/artifactsAmazon S3Store raw data, training data, model artifacts, logsNot automatically a catalog, access governance, or model registry
Data lake permissionsAWS Lake FormationGovern data lake access and permissionsAssuming S3 bucket policies alone cover all lake governance needs
Data discovery/governance collaborationAmazon DataZoneShare, catalog, discover, and govern data across teamsConfusing with model monitoring or AI guardrails

Fast decision path: managed AI, generative AI, or custom ML

    flowchart TD
	    A[AI requirement] --> B{Common prebuilt task?}
	    B -->|Speech, OCR, vision, translation, NLP| C[Use specialized AWS AI service]
	    B -->|No| D{Need generated language, chat, summarization, embeddings, agents?}
	    D -->|Yes| E{Needs private or current knowledge?}
	    E -->|Yes| F[Amazon Bedrock + RAG / Knowledge Bases]
	    E -->|No| G[Amazon Bedrock prompt, FM, or agent]
	    D -->|No| H{Need custom predictive model?}
	    H -->|Business user/no code| I[SageMaker Canvas]
	    H -->|Code, training, tuning, MLOps| J[Amazon SageMaker]
	    H -->|No| K[Re-check problem; may be analytics, automation, or search]

Amazon Bedrock and generative AI reference

Bedrock conceptWhat it doesExam decision cue
Foundation modelsPretrained models available through a managed serviceChoose based on task, modality, quality, latency, cost, context needs, and customization support
Model inferenceSend prompt/input and receive outputCore operation for chat, generation, summarization, classification, extraction
Embeddings modelConverts content into vectorsRequired for semantic search and many RAG designs
Knowledge Bases for Amazon BedrockManaged RAG workflow over enterprise data sources/vector storesUse when the scenario says ground responses in private documents
Agents for Amazon BedrockOrchestrates tasks and can call tools/APIsUse when the model must take actions or complete multi-step workflows
Guardrails for Amazon BedrockApplies safety, topic, content, and policy controlsUse to reduce unsafe, off-topic, or noncompliant outputs; not a replacement for IAM
Model customizationAdapts supported models using your dataUse when prompt/RAG are insufficient for behavior, domain language, or output style
Model evaluationCompares models and outputs against criteriaUse before selecting or changing models; include quality, safety, latency, and cost
Prompt managementVersion and manage promptsUseful when prompts are application assets requiring repeatability and change control
Inference parametersControl output style and lengthTemperature/top-p affect variability; max tokens affects length/cost/latency

Foundation model selection checklist

FactorAsk this
Task fitIs the model strong for chat, summarization, code, extraction, image, or embeddings?
ModalityDoes it support required inputs and outputs: text, image, audio, document, code?
QualityDoes it meet accuracy, reasoning, grounding, and formatting requirements?
LatencyCan it respond within user or workflow expectations?
Cost efficiencyCan a smaller or more specialized model meet the requirement?
Context handlingCan the model handle the needed prompt plus retrieved context?
SafetyAre guardrails, moderation, and evaluation adequate for the use case?
CustomizationDoes the model support the needed fine-tuning or customization method?
GovernanceCan access, logging, data handling, and compliance expectations be met?

Prompt engineering quick reference

Pattern or parameterUse it forWatch for
Clear task instructionTell the model exactly what to doVague prompts lead to broad or inconsistent output
Role/context“Act as a support analyst…”Role alone does not guarantee correctness
DelimitersSeparate instructions, source text, examples, and output schemaReduces prompt confusion and injection risk
Few-shot examplesShow desired input/output behaviorBad examples teach bad behavior
Output schemaRequest JSON, table, bullet list, or strict fieldsValidate output; models can still produce malformed structures
Grounding contextProvide authoritative content to answer fromAsk model to say when context is insufficient
Step decompositionBreak complex tasks into smaller stepsDo not expose sensitive reasoning if not needed
Low temperatureMore consistent, deterministic-style outputsMay reduce creativity
Higher temperatureBrainstorming or varied creative outputsCan increase inconsistency or hallucination
Top-pControls sampling diversityDo not randomly tune without evaluation
Max tokensLimits response lengthToo low can truncate; too high can increase cost/latency
Stop sequencesEnd generation at desired markersMisconfigured stops can cut off valid output
Negative instructionsTell the model what to avoidPair with positive instructions and validation

Prompt safety traps

TrapMitigation
Putting secrets in promptsDo not send credentials, private keys, or unnecessary sensitive data
Trusting user-provided instructions inside documentsTreat retrieved or uploaded content as untrusted data, not system instructions
Assuming prompt wording is a security controlUse IAM, authorization, validation, logging, and guardrails
Relying on “always answer correctly” wordingUse grounding, citations, evaluation, and human review
Overloading prompts with irrelevant contextRetrieve only relevant chunks and keep prompts focused

RAG: Retrieval-Augmented Generation

RAG is high yield for AIF-C01 because it connects generative AI with enterprise data, embeddings, vector search, and hallucination reduction.

RAG stepWhat happensAWS-oriented cue
IngestCollect documents/dataAmazon S3 and enterprise data sources are common starting points
PrepareClean, split, and chunk contentChunk size affects retrieval quality and prompt cost
EmbedConvert chunks to vectorsUse an embeddings model, often through Amazon Bedrock
Store/indexSave vectors and metadataVector-capable stores or managed knowledge base integrations
RetrieveFind relevant chunks for a querySimilarity search plus filters improves relevance
Augment promptAdd retrieved context to the promptTell model to answer only from provided context when needed
GenerateFoundation model produces responseInclude citations or references when required
EvaluateMeasure grounding and answer qualityTest with real questions and known answers
MonitorTrack failures, drift, latency, costUpdate indexes when source content changes

RAG vs fine-tuning vs prompt-only

TechniqueBest whenNot best when
Prompt-onlyTask is simple; instructions and examples fit in contextNeed large private knowledge base or persistent behavior change
RAGNeed current, proprietary, or citeable factsNeed model to learn a new style deeply or change internal behavior
Fine-tuningNeed consistent task behavior, tone, format, or domain adaptationFacts change frequently; retrieval would be easier
Continued pretrainingNeed broader domain language adaptation for supported modelsSmall task-specific changes or quick iteration
Full custom MLNeed model architecture/control/training pipeline beyond FM application patternsA managed FM app already satisfies the use case

Machine learning problem types

Problem typeGoalExampleCommon metric family
Binary classificationPredict one of two classesFraud/not fraudAccuracy, precision, recall, F1, ROC-AUC
Multiclass classificationPredict one of many classesTicket categoryAccuracy, macro/micro F1
RegressionPredict numeric valueHouse price, demandMAE, MSE, RMSE, R-squared
ClusteringGroup similar items without labelsCustomer segmentsSilhouette, business validation
Anomaly detectionFind unusual observationsSuspicious transactionsPrecision/recall, false positive rate
ForecastingPredict future time-based valuesInventory demandMAE, MAPE, RMSE
RecommendationSuggest relevant itemsProduct recommendationsClick-through, ranking metrics, conversion
Natural language processingUnderstand/process textSentiment, entitiesAccuracy/F1 or task-specific metrics
Computer visionAnalyze images/videoDefect detectionPrecision/recall, IoU for detection
Reinforcement learningLearn actions through rewardsOptimization/controlReward over time, policy performance

Learning approaches

ApproachUses labeled data?OutputExam cue
Supervised learningYesPredict label or valueClassification and regression
Unsupervised learningNoDiscover structureClustering, dimensionality reduction, anomaly discovery
Semi-supervised learningSomeUses small labeled plus large unlabeled dataUseful when labels are expensive
Reinforcement learningUses rewardsPolicy/action strategyAgent learns through interaction
Self-supervised learningLabels derived from data itselfRepresentations/foundation modelsCommon in large-scale pretraining
Transfer learningStarts from pretrained modelAdapts to new taskReduces data/training needs
Active learningModel selects examples needing labelsEfficient labelingUseful when human labels are costly

Amazon SageMaker lifecycle reference

Lifecycle phaseSageMaker/AWS capabilityWhat to know for the exam
Data storageAmazon S3Common storage for datasets, features, artifacts, logs
Data prepAWS Glue, SageMaker Data WranglerClean, transform, and prepare data
LabelingSageMaker Ground TruthHuman labeling workflows for supervised ML
ExplorationSageMaker Studio / notebooksDevelopment environment for data scientists
No-code MLSageMaker CanvasBusiness users can build predictions without coding
Automated MLSageMaker AutopilotAutomatically builds and compares candidate models
TrainingSageMaker training jobsManaged infrastructure for model training
TuningHyperparameter tuningSearches for better model settings
Experiment trackingSageMaker ExperimentsTracks runs, parameters, metrics
Model packagingModel artifacts and containersNeeded before deployment
Registry/governanceSageMaker Model RegistryVersion, approve, and manage models
Real-time inferenceSageMaker endpointsLow-latency online predictions
Batch inferenceSageMaker Batch TransformOffline predictions over batches
Intermittent/serverless inferenceSageMaker Serverless InferenceVariable or unpredictable traffic patterns
Long-running/large async requestsSageMaker Asynchronous InferenceDecoupled request/response workloads
MonitoringSageMaker Model MonitorDetect data quality and drift issues
Bias/explainabilitySageMaker ClarifyAnalyze bias and feature attribution/explainability
PipelinesSageMaker PipelinesAutomate ML workflows and CI/CD-style steps

Evaluation metrics

Confusion matrix terms

TermMeaning
True positiveModel predicted positive, and actual is positive
False positiveModel predicted positive, but actual is negative
True negativeModel predicted negative, and actual is negative
False negativeModel predicted negative, but actual is positive

Classification metric selection

MetricUse whenTrap
AccuracyClasses are balanced and error costs are similarMisleading with imbalanced classes
PrecisionFalse positives are costlyHigh precision can miss many real positives
RecallFalse negatives are costlyHigh recall can increase false positives
F1 scoreNeed balance between precision and recallHides whether precision or recall matters more
ROC-AUCCompare ranking quality across thresholdsCan be less intuitive for severe class imbalance
Confusion matrixNeed error breakdown by classNot a single optimization metric
\[ \begin{aligned} \text{Precision} &= \frac{TP}{TP + FP} \\ \text{Recall} &= \frac{TP}{TP + FN} \\ \text{F1} &= 2 \cdot \frac{\text{Precision} \cdot \text{Recall}}{\text{Precision} + \text{Recall}} \end{aligned} \]

Regression metric selection

MetricUse whenTrap
MAENeed average absolute error in original unitsTreats all errors linearly
MSEWant to penalize larger errors moreSquared units are harder to interpret
RMSEWant original units while penalizing large errorsSensitive to outliers
R-squaredNeed explained variance viewCan be misleading alone
\[ \begin{aligned} \text{MAE} &= \frac{1}{n}\sum_{i=1}^{n} |y_i - \hat{y}_i| \\ \text{MSE} &= \frac{1}{n}\sum_{i=1}^{n} (y_i - \hat{y}_i)^2 \\ \text{RMSE} &= \sqrt{\frac{1}{n}\sum_{i=1}^{n} (y_i - \hat{y}_i)^2} \end{aligned} \]

Generative AI evaluation

Evaluation areaWhat to check
RelevanceDoes the response answer the user’s question?
GroundednessIs the answer supported by provided context or trusted sources?
FactualityAre statements correct?
Hallucination rateHow often unsupported or fabricated claims appear
Toxicity/safetyWhether output violates safety or content policies
Bias/fairnessWhether outputs treat groups unfairly
RobustnessWhether output remains acceptable across prompt variations
Format adherenceWhether JSON, tables, or required fields are valid
LatencyWhether response time meets requirements
Cost efficiencyWhether token/model choices are appropriate
Human preferenceHuman review or pairwise comparison where automated metrics are insufficient

Responsible AI reference

RiskWhat it looks likeMitigation options
BiasUnequal performance across groupsRepresentative data, bias testing, SageMaker Clarify, human review
HallucinationConfident false answerRAG, citations, grounding checks, refusal rules, human approval
Toxic outputHarmful, unsafe, abusive contentAmazon Bedrock Guardrails, moderation, testing, policy controls
Data leakageSensitive data appears in prompts/logs/outputsData classification, redaction, IAM, encryption, logging controls
Prompt injectionUser or document tries to override instructionsSeparate trusted instructions from untrusted content; validate tool calls
OverrelianceUsers accept AI output without judgmentHuman-in-the-loop for high-impact decisions
Lack of explainabilityCannot justify predictionInterpretable models, feature attribution, documentation
Model driftPerformance degrades over timeMonitoring, retraining, alerts, evaluation baselines
Data poisoningTraining/retrieval data is manipulatedSource validation, access controls, lineage, review workflows
Inappropriate useModel used outside intended contextUsage policies, access control, monitoring, user education

Responsible AI principles to connect to scenarios

PrinciplePractical exam interpretation
FairnessTest for disparate impact and biased outcomes
ExplainabilityProvide understandable reasons or feature influence where needed
PrivacyMinimize, protect, and control sensitive data
SafetyPrevent harmful, toxic, or unsafe outputs/actions
TransparencyDocument model purpose, limitations, data, and intended use
RobustnessValidate behavior under varied inputs and edge cases
AccountabilityAssign ownership, approvals, monitoring, and escalation paths
Human oversightKeep humans in control for high-risk or ambiguous decisions

Security and governance for AI workloads on AWS

Control areaAWS services/featuresExam-prep distinction
Identity and authorizationAWS Identity and Access Management, IAM roles, policiesLeast privilege controls who can invoke models, access data, deploy endpoints
Central identityAWS IAM Identity CenterWorkforce access management; not a model evaluation tool
Encryption at restAWS Key Management Service, service encryption featuresProtect datasets, logs, model artifacts, vector stores
Encryption in transitTLS/service endpointsProtect data moving between clients and services
Network isolationVPC design, security groups, private connectivity where supportedReduce public exposure for data and inference paths
SecretsAWS Secrets Manager, AWS Systems Manager Parameter StoreDo not hardcode API keys or database credentials
LoggingAWS CloudTrail, Amazon CloudWatch LogsCloudTrail records API activity; CloudWatch collects metrics/logs/alarms
MonitoringAmazon CloudWatch, SageMaker Model MonitorSystem health plus model/data quality monitoring
Data discoveryAmazon MacieFind and help protect sensitive data in Amazon S3
Threat detectionAmazon GuardDutyDetect suspicious activity; not an AI quality tool
Configuration governanceAWS ConfigTrack resource configuration and compliance rules
Organization guardrailsAWS Organizations, service control policiesRestrict actions across accounts
Data lake governanceAWS Lake FormationFine-grained data lake permissions
AI output controlsAmazon Bedrock GuardrailsSafety/content controls; not a substitute for IAM, encryption, or monitoring

Shared responsibility reminder

AWS is responsible forCustomer is responsible for
Security of AWS infrastructureData classification and lawful/appropriate use
Managed service infrastructure operationsIAM policies, roles, access boundaries
Physical facilities and core cloud platformPrompt content, uploaded data, and retrieval sources
Availability of AWS service controlsConfiguration of logging, encryption, monitoring, and guardrails
Service security featuresEvaluating outputs, handling bias, hallucination, and misuse risks

Data foundations for AI on AWS

Data needAWS serviceWhy it matters for AI
Durable object storageAmazon S3Common location for raw, curated, training, and inference data
Catalog and ETLAWS GlueMakes data discoverable and transformable
Query data in S3Amazon AthenaServerless SQL queries for data exploration
Data warehouseAmazon RedshiftAnalytics at scale; can feed ML and business intelligence
Streaming dataAmazon Kinesis, Amazon MSKReal-time event ingestion for analytics/ML pipelines
Operational relational dataAmazon RDS, Amazon AuroraStructured application data
Key-value/noSQL dataAmazon DynamoDBLow-latency application data and metadata
Search/vector retrievalAmazon OpenSearch ServiceSearch, log analytics, vector similarity
Data governanceAWS Lake Formation, Amazon DataZonePermissions, discovery, sharing, governance
Sensitive data discoveryAmazon MacieHelps identify sensitive data before using it in AI workflows

Deployment and inference patterns

PatternChoose whenAWS cue
Real-time inferenceUser or application needs immediate responseSageMaker endpoint or Bedrock model invocation
Batch inferenceLarge set of predictions can run offlineSageMaker Batch Transform or batch application workflow
Asynchronous inferenceRequests are large or processing takes longerDecoupled request handling with async pattern
Serverless inferenceTraffic is intermittent or unpredictableAvoid managing always-on capacity where supported
Edge inferenceNeed local/low-latency processing near devicesConsider edge-capable deployment patterns
API-backed GenAI appApplication calls model through APICommon with Amazon Bedrock, AWS Lambda, Amazon API Gateway
Human-in-the-loopDecisions need review or labelingSageMaker Ground Truth or workflow approval patterns
MLOps pipelineRepeatable build/test/deploy/monitorSageMaker Pipelines, Model Registry, CI/CD integration

Cost and performance levers

LeverWhy it matters
Choose the right model sizeSmaller/specialized models may be faster and cheaper if quality is sufficient
Reduce unnecessary tokensShorter prompts and outputs can reduce latency and cost
Use retrieval carefullyMore context can improve grounding but increases prompt size
Cache repeated outputs where appropriateReduces repeated inference for identical or stable requests
Batch offline workOften more efficient than real-time calls for noninteractive workloads
Monitor utilizationAvoid idle always-on resources when usage is intermittent
Tune thresholdsClassification threshold changes precision/recall tradeoff without retraining
Evaluate before scalingDo not scale a poor prompt/model/pipeline before measuring quality
Automate cleanupRemove unused notebooks, endpoints, artifacts, and logs according to policy
Track business metricAccuracy alone may not prove business value

High-yield AWS distinctions

DistinctionChoose this whenNot this when
Amazon Bedrock vs SageMakerBedrock for managed foundation model appsSageMaker for full custom ML lifecycle
Amazon Q Business vs BedrockQ Business for ready enterprise assistantBedrock for custom GenAI application/platform control
Amazon Q Developer vs BedrockQ Developer for coding assistanceBedrock for building your own app
Textract vs RekognitionTextract extracts document text/forms/tablesRekognition detects labels, objects, faces, video content
Transcribe vs PollyTranscribe converts speech to textPolly converts text to speech
Translate vs ComprehendTranslate changes languageComprehend analyzes text meaning/entities/sentiment
Kendra vs OpenSearchKendra for managed enterprise search relevanceOpenSearch for search infrastructure, logs, vector search control
RAG vs fine-tuningRAG for current/private factual knowledgeFine-tuning for behavior, style, or task adaptation
Guardrails vs IAMGuardrails influence model content/safetyIAM controls who can access what
CloudWatch vs CloudTrailCloudWatch monitors metrics/logs/alarmsCloudTrail records API activity
Model Monitor vs ClarifyModel Monitor detects drift/data quality issuesClarify helps with bias and explainability
Accuracy vs F1Accuracy for balanced classesF1 for precision/recall balance, especially imbalance
Precision vs recallPrecision reduces false positivesRecall reduces false negatives

Common AIF-C01 scenario cues

If the question says…Think…
“Summarize internal policy documents accurately”Amazon Bedrock with RAG/Knowledge Bases; evaluate groundedness
“Reduce hallucinations using company documents”RAG, citations, retrieval quality, guardrails, human review
“Build a chatbot with intents and slots”Amazon Lex
“Convert call recordings to text”Amazon Transcribe
“Generate spoken audio from text”Amazon Polly
“Extract fields from invoices or forms”Amazon Textract
“Detect objects or unsafe image content”Amazon Rekognition
“Analyze sentiment and entities in text”Amazon Comprehend
“Business user wants no-code predictions”SageMaker Canvas
“Data scientist needs training jobs and model endpoints”Amazon SageMaker
“Need human labelers for training data”SageMaker Ground Truth
“Need to compare model versions and approve deployment”SageMaker Model Registry
“Need to detect drift after deployment”SageMaker Model Monitor
“Need to analyze bias or feature attribution”SageMaker Clarify
“Need a workplace assistant over enterprise data”Amazon Q Business
“Need code suggestions in IDE”Amazon Q Developer
“Need log/metric alarms”Amazon CloudWatch
“Need record of who called what API”AWS CloudTrail
“Need find sensitive data in S3”Amazon Macie
“Need restrict accounts from using certain actions”AWS Organizations service control policies

Data quality and model quality traps

TrapWhy it matters
Training/test leakageInflates evaluation; model may fail in production
Imbalanced classesAccuracy can look high while minority class performance is poor
Nonrepresentative dataModel performs poorly for real users or future conditions
Label noiseSupervised model learns incorrect patterns
Missing valuesCan bias results or break pipelines if untreated
OutliersCan distort regression and distance-based models
Correlation mistaken for causationML predictions do not automatically prove cause
Optimizing only technical metricsBusiness outcome, risk, fairness, and cost may be unacceptable
Ignoring threshold selectionSame model can behave very differently at different classification thresholds
No monitoring after deploymentDrift, data changes, and degradation go unnoticed

Quick pre-exam checklist

  • Can you distinguish Amazon Bedrock, Amazon SageMaker, Amazon Q, and specialized AWS AI services?
  • Can you choose between prompt engineering, RAG, fine-tuning, and custom ML?
  • Can you match metrics to classification, regression, and generative AI evaluation?
  • Can you explain precision vs recall and when false positives or false negatives matter?
  • Can you identify responsible AI controls for bias, hallucination, toxicity, privacy, and explainability?
  • Can you separate IAM/security controls from model safety guardrails?
  • Can you identify the role of CloudWatch, CloudTrail, KMS, Macie, Lake Formation, and AWS Organizations in AI governance?
  • Can you recognize when a scenario needs human review, data labeling, model monitoring, or retraining?

Practical next step

Use this Quick Reference to drill scenario questions: for each prompt, first identify the problem type, then choose the AWS service, then name the key security, governance, evaluation, or responsible AI control that makes the answer complete.

Browse Certification Practice Tests by Exam Family