MLA-C01 — AWS Certified Machine Learning Engineer – Associate Quick Reference

Compact AWS MLA-C01 reference for machine learning engineering: data prep, SageMaker training, deployment, MLOps, monitoring, and security decisions.

Exam identity and quick-use map

This independent Quick Reference supports preparation for AWS Certified Machine Learning Engineer – Associate (MLA-C01). It focuses on the AWS service choices, ML engineering workflows, security controls, deployment patterns, and troubleshooting distinctions that commonly drive scenario questions.

Use this page to answer: What AWS service or pattern should I choose, and why?

    flowchart LR
	    A[Data sources] --> B[Ingest and store]
	    B --> C[Clean, label, transform]
	    C --> D[Feature engineering]
	    D --> E[Train or tune model]
	    E --> F[Evaluate]
	    F --> G{Meets criteria?}
	    G -- No --> C
	    G -- Yes --> H[Register and approve]
	    H --> I[Deploy: real-time, async, batch, serverless]
	    I --> J[Monitor: data, quality, bias, latency, errors]
	    J --> K{Drift or degradation?}
	    K -- Yes --> L[Retrain pipeline]
	    L --> E
	    K -- No --> I

High-yield AWS service selection

Need in scenarioPreferWhy it fitsCommon trap
Durable landing zone for training data, artifacts, model outputsAmazon S3Native integration with SageMaker, Glue, Athena, EMR, Redshift SpectrumDo not store large training datasets only on notebook instance storage
Data catalog for files in S3AWS Glue Data CatalogCentral schema/catalog for Athena, Glue, EMR, Redshift SpectrumAthena queries data; Glue Data Catalog stores metadata
Serverless SQL over S3Amazon AthenaAd hoc queries without managing clustersNot ideal for heavy ETL pipelines that need complex transforms
Serverless ETL, crawlers, Spark jobsAWS GlueManaged ETL and schema discoveryUse EMR when cluster-level control/custom big data stack is required
Custom big data processing frameworksAmazon EMRManaged Hadoop/Spark/Hive ecosystem with more configuration controlMore operational responsibility than Glue
Data warehouse analyticsAmazon RedshiftColumnar analytics, BI, warehouse workloadsS3 + Athena is often enough for ad hoc lake queries
Streaming ingestion with custom consumersAmazon Kinesis Data StreamsLow-latency streams and multiple consuming appsNot the same as Firehose delivery
Managed streaming delivery to S3/Redshift/OpenSearchAmazon Data FirehoseMinimal administration for delivery and bufferingLess control than Kinesis Data Streams
Kafka-compatible streamingAmazon MSKManaged Apache Kafka compatibilityChoose only when Kafka ecosystem compatibility matters
Human data labelingAmazon SageMaker Ground TruthManaged labeling workflows and workforcesFor sensitive data, prefer private workforce controls
Reusable online/offline ML featuresAmazon SageMaker Feature StoreHelps reduce training-serving skewDo not duplicate feature logic in separate train and inference code
No-code/low-code ML explorationAmazon SageMaker CanvasBusiness-user model building and predictionsProduction-grade MLOps still needs controlled pipelines and deployment
Managed notebook and ML IDEAmazon SageMaker StudioDevelopment, experiments, pipelines, model registry integrationNotebook success does not equal reproducible pipeline
Managed training jobsAmazon SageMaker TrainingScalable, repeatable training with containers, S3 inputs, IAM rolesAvoid training on notebook instances for production workflows
Hyperparameter searchSageMaker automatic model tuningRuns multiple training jobs against objective metricDo not tune against final test set
ML workflow orchestrationAmazon SageMaker PipelinesML-native steps, lineage, parameters, model registry integrationUse Step Functions for broader cross-service workflow orchestration
General workflow orchestration across AWS servicesAWS Step FunctionsServerless state machines, retries, approvals, integrationsLess ML-specific lineage than SageMaker Pipelines
Model package approval and versioningSageMaker Model RegistryTracks model versions, metadata, approval stateS3 artifact alone is not governed deployment
Real-time hosted inferenceSageMaker real-time endpointPersistent low-latency API endpointIdle endpoints can create unnecessary cost
Offline scoring of large datasetsSageMaker Batch TransformNo persistent endpoint; reads/writes S3Not for interactive request/response inference
Large payloads or longer inference timesSageMaker Asynchronous InferenceQueued requests, S3 outputs, scales endpoint capacityNot true synchronous low-latency API behavior
Intermittent inference trafficSageMaker Serverless InferenceNo instance management for spiky/idle workloadsConsider cold starts and workload suitability
Many similar models behind one endpointSageMaker multi-model endpointConsolidates model hostingModel load/cache behavior can affect latency
Foundation model API without managing model infrastructureAmazon BedrockManaged access to foundation models, agents, guardrails, knowledge basesDo not choose custom SageMaker training when managed FM API is enough
Custom ML containersAmazon ECR + SageMakerBring your own algorithm or inference containerContainer must satisfy SageMaker training/inference contracts
Logs, metrics, alarmsAmazon CloudWatchOperational visibility for endpoints, training, pipelinesModel quality drift requires ML-specific monitoring too
API auditingAWS CloudTrailWho called what AWS API and whenCloudWatch logs are not a substitute for API audit trails
Sensitive data discovery in S3Amazon MacieFinds and reports sensitive dataMacie does not replace IAM, KMS, or data access design

Data engineering and preparation reference

Storage, catalog, and query decisions

PatternBest fitExam cues
Raw/bronze data lakeS3 buckets with prefixes, encryption, lifecycle policies“Store raw source data durably and cheaply”
Curated training datasetS3 curated prefix, Parquet/CSV/RecordIO as appropriate“Reusable prepared dataset for training jobs”
Schema discoveryGlue crawler + Glue Data Catalog“Infer schema from files in S3”
SQL explorationAthena“Run SQL directly on S3 data”
Repeatable ETLGlue ETL job or SageMaker ProcessingGlue for general ETL; SageMaker Processing when tightly coupled to ML workflow
Distributed feature engineeringGlue, EMR, or SageMaker Processing with SparkChoose based on required control and integration
Warehouse-to-ML sourceRedshift unload/query integration, Data Wrangler, or direct connector“Training from warehouse data”
Streaming features/eventsKinesis Data Streams, MSK, Data FirehoseDistinguish custom stream processing from managed delivery

Data split and leakage traps

SituationSplit strategyWatch for
Independent and identically distributed tabular dataRandom train/validation/test splitFit preprocessing only on training data
Imbalanced classesStratified splitAccuracy may be misleading
Time series forecastingChronological splitRandom split leaks future information
Same user/device/account appears many timesGroup-aware splitAvoid same entity in train and test
Small datasetCross-validation if feasibleKeep final holdout untouched
Hyperparameter tuningTrain/validation or cross-validationTest set is for final estimate only
Feature engineering before splitUsually unsafeScaling, imputation, encoding, and feature selection can leak test statistics

Data quality checklist

CheckWhy it mattersAWS-oriented action
Missing valuesMany algorithms cannot use nulls directlyImpute, add missing indicator, or filter
OutliersCan dominate loss and scalingCap, transform, robust scaling, investigate source
Class imbalanceOptimizer may favor majority classResampling, class weights, threshold tuning, PR AUC/F1
Label noiseLimits achievable accuracyGround Truth review, consensus labeling, quality audits
Duplicate rowsCan leak across splitsDeduplicate before splitting or group split
Skewed distributionsAffects linear models and distance methodsLog transform, normalization, robust scaling
High cardinality categoricalsSparse and overfit-proneHashing, target encoding with care, embeddings
PII/sensitive dataSecurity and governance riskMacie, IAM least privilege, KMS, tokenization/redaction

Feature Store concepts

ConceptMeaningExam relevance
Feature groupNamed collection of feature definitions and recordsOrganizes reusable features
Offline storeHistorical features, typically in S3Training, batch analytics, backfills
Online storeLow-latency feature lookupReal-time inference
Event timeTimestamp associated with feature recordCorrect point-in-time training data
Training-serving skewDifferent feature logic or freshness between training and productionFeature Store and shared transformation code reduce risk

SageMaker development and training

Development environment choices

NeedChooseNotes
Full ML IDE and managed notebooksSageMaker StudioUseful for experiments, pipelines, registry, monitoring
Notebook-only experimentationSageMaker notebook instances or Studio notebooksStop idle resources; not a production pipeline by itself
Business-user model buildingSageMaker CanvasLow-code predictions and exploration
Visual data prepSageMaker Data Wrangler where available in the workflowUseful for profiling, transforms, export to jobs/pipelines
Scripted reproducible processingSageMaker ProcessingRun preprocessing/evaluation containers at scale
Production trainingSageMaker Training jobIsolated, repeatable, containerized, logged

Training job anatomy

Recognize these knobs in scenario and configuration questions:

TrainingJob:
  AlgorithmSpecification:
    TrainingImage: <ECR image or built-in algorithm>
    TrainingInputMode: File | FastFile | Pipe
  RoleArn: <SageMaker execution role>
  InputDataConfig:
    - ChannelName: train
      DataSource: s3://bucket/prefix/train/
    - ChannelName: validation
      DataSource: s3://bucket/prefix/validation/
  OutputDataConfig:
    S3OutputPath: s3://bucket/prefix/model-artifacts/
    KmsKeyId: <optional KMS key>
  ResourceConfig:
    InstanceType: <training instance type>
    InstanceCount: <count>
    VolumeKmsKeyId: <optional KMS key>
  HyperParameters:
    objective: binary:logistic
  VpcConfig:
    Subnets: [private-subnet]
    SecurityGroupIds: [sg-id]
  StoppingCondition:
    MaxRuntimeInSeconds: <limit>

Training input modes and data access

Mode/sourceBest fitTrap
File modeCommon default; data copied from S3 to training volumeStartup can be slower for very large data
FastFile modeS3 data exposed with file-like access where supportedConfirm algorithm/framework support
Pipe modeStreams data to algorithm where supportedContainer/algorithm must support streaming
Amazon FSx for LustreHigh-performance distributed file accessMore setup than simple S3 inputs
Amazon EFSShared file system across instancesConsider throughput and access pattern
Checkpoints to S3Long or interruptible training jobsNeeded to resume rather than restart from scratch

Container and algorithm choices

ChoiceUse whenNotes
Built-in SageMaker algorithmStandard algorithm fits problemLess container work, optimized integration
Framework estimator/script modeTensorFlow, PyTorch, scikit-learn, XGBoost scriptsBring training script; SageMaker manages job
Custom Docker containerNeed custom runtime, dependencies, algorithm, or inference stackMust follow SageMaker container conventions
Bring your own model artifactModel already trained elsewherePackage with compatible inference container
Amazon ECR imageCustom training/inference imageExecution role needs pull permissions

Built-in algorithm selection cues

Problem cueLikely algorithm familyNotes
Tabular classification/regression with nonlinear patternsXGBoostStrong default for structured data
Large-scale linear classification/regressionLinear LearnerWorks well for sparse/high-dimensional linear problems
Recommendation or sparse feature interactionsFactorization MachinesCommon for user-item sparse matrices
Clustering without labelsK-MeansUnsupervised segmentation
Anomaly detection in numeric/time-series-like dataRandom Cut ForestDetects unusual observations
Text classification or word embeddingsBlazingTextText-focused built-in option
Forecasting multiple related time seriesDeepARUses historical time series patterns
Image classification/detectionImage Classification, Object Detection, or framework modelOften use transfer learning or pretrained models
Custom deep learning architecturePyTorch/TensorFlow on SageMakerUse framework estimator or custom container

Hyperparameter tuning

ElementWhat to know
Objective metricMetric to maximize or minimize; must be emitted by training job
Search spaceRanges or categorical values for hyperparameters
Early stoppingStops weak jobs when supported/appropriate
Validation setUsed to compare tuning jobs
Final test setHeld out until final evaluation
Overfitting riskMore tuning can overfit validation data

Model evaluation metrics

Confusion matrix terms

TermMeaning
TPPredicted positive and actually positive
FPPredicted positive but actually negative
TNPredicted negative and actually negative
FNPredicted negative but actually positive
\[ \begin{aligned} Accuracy &= \frac{TP + TN}{TP + TN + FP + FN} \\ Precision &= \frac{TP}{TP + FP} \\ Recall &= \frac{TP}{TP + FN} \\ F1 &= 2 \cdot \frac{Precision \cdot Recall}{Precision + Recall} \end{aligned} \]

Metric selection table

Task or riskPreferAvoid over-relying on
Balanced classificationAccuracy, ROC AUC, F1Accuracy alone if costs differ
Rare positive classPrecision, recall, F1, PR AUCAccuracy and sometimes ROC AUC
False negatives are costlyRecall/sensitivityPrecision alone
False positives are costlyPrecisionRecall alone
Probabilistic classificationLog loss, calibrationOnly thresholded accuracy
Regression with large-error penaltyRMSEMAE if large errors must be emphasized
Regression with robust typical errorMAERMSE if outliers dominate unfairly
ForecastingMAE, RMSE, MAPE/sMAPE where validMAPE when actual values can be zero
Ranking/recommendationNDCG, MAP, precision@k/recall@kGeneric classification accuracy
ClusteringSilhouette score, within-cluster sum of squaresSupervised metrics without labels
\[ RMSE = \sqrt{\frac{1}{n}\sum_{i=1}^{n}(y_i - \hat{y}_i)^2} \]\[ MAE = \frac{1}{n}\sum_{i=1}^{n}|y_i - \hat{y}_i| \]

Evaluation traps

TrapCorrect reasoning
“High accuracy” on imbalanced dataCheck confusion matrix, recall, precision, F1, PR AUC
Tuning threshold on test setTune threshold on validation set; reserve test set
Random split for time seriesUse chronological split
Preprocessing entire dataset before splitFit transforms on train only, apply to validation/test
Comparing models with different test dataUse the same holdout or controlled cross-validation
Better offline metric but worse productionInvestigate data drift, training-serving skew, latency/timeouts, feature freshness

Deployment and inference patterns

Inference mode decision matrix

RequirementChooseWhyWatch for
Low-latency request/responseSageMaker real-time endpointPersistent HTTPS endpointScale and monitor latency/errors
Spiky or intermittent trafficSageMaker Serverless InferenceNo instance managementCold start and workload suitability
Large payloads or long processingSageMaker Asynchronous InferenceQueued async invocation, S3 outputClient does not wait synchronously
Offline batch scoringSageMaker Batch TransformReads S3 input, writes S3 outputNo always-on endpoint
Many tenant- or segment-specific modelsMulti-model endpointHosts multiple models behind one endpointInitial model load can add latency
Multiple containers in one endpointMulti-container endpointDirect or serial container invocation patternsNot the same as multi-model hosting
Edge or disconnected inferenceAWS IoT Greengrass or device runtime patternLocal inference near data sourceModel update and device security matter
Lightweight model behind app APIAWS Lambda plus API Gateway, if suitableSimple serverless app integrationNot ideal for large models/heavy inference

Deployment controls

ControlUse forNotes
Production variantsTraffic splitting across model variantsSupports A/B style testing
Shadow variantTest new model on production traffic without serving its responseUseful before promotion
Canary/linear rollout patternGradual production traffic shiftPair with CloudWatch alarms and rollback
Auto scalingAdjust endpoint capacity based on demandMonitor latency, invocation volume, errors
Data captureStore inference inputs/outputs in S3Required for many monitoring workflows
Model Registry approvalGate promotion to staging/prodSupports governance and reproducibility
Inference RecommenderEvaluate hosting instance/config optionsUse when unsure about performance/cost tradeoff

SageMaker inference container contract

EndpointPurpose
/pingHealth check
/invocationsInference requests

Common container issues: wrong content type, missing dependencies, slow model load, model artifact path mismatch, container not listening correctly, memory exhaustion, or IAM denial when pulling ECR image or reading S3 artifact.

Generative AI and foundation model choices

ScenarioPreferReasoning
Use managed foundation models through APIAmazon BedrockAvoids managing model infrastructure
Need guardrails for FM application behaviorGuardrails for Amazon BedrockCentral control for safety and policy behavior
Need RAG over enterprise documentsKnowledge Bases for Amazon Bedrock or custom RAG stackRetrieves current private context instead of retraining model for facts
Need agents that call tools/APIsAgents for Amazon BedrockOrchestrates tasks with FM reasoning and actions
Need deploy/tune open or pretrained model in SageMaker environmentSageMaker JumpStart or SageMaker hostingMore control over model/container/VPC/MLOps
Need custom model architecture/training loopSageMaker custom trainingFull control, more engineering responsibility
Need semantic searchEmbeddings + vector store such as Amazon OpenSearch Service/OpenSearch Serverless, Aurora PostgreSQL with vector support, or managed Bedrock knowledge baseMatch text by meaning, not exact keywords

Prompt, RAG, fine-tuning, or training?

NeedUsually chooseWhy
Change output format, tone, instructionsPrompt engineeringFastest and lowest operational complexity
Use private or frequently changing factsRAGKeeps knowledge external and updateable
Improve behavior on repeated task patternFine-tuning/customization where supportedTeaches task style or domain pattern
Add brand-new domain facts onlyRAG firstFine-tuning is not a reliable database
Build specialized model from scratchCustom trainingHighest cost/complexity; use only when necessary

MLOps and automation

Pipeline stages to recognize

StageSageMaker/AWS service fitKey artifacts
IngestS3, Kinesis, Data Firehose, DMSRaw data
Validate/profileGlue Data Quality, SageMaker Processing, Data WranglerData reports, constraints
TransformGlue, EMR, SageMaker ProcessingCurated dataset, features
TrainSageMaker TrainingModel artifact, metrics
TuneSageMaker automatic model tuningBest training job, hyperparameters
EvaluateSageMaker Processing or pipeline evaluation stepEvaluation report
Conditional gateSageMaker Pipelines condition stepPass/fail metric rule
RegisterSageMaker Model RegistryModel package/version
ApproveManual or automated approval workflowApproval state
DeploySageMaker endpoint, Batch Transform, CI/CD pipelineEndpoint or batch job
MonitorModel Monitor, Clarify, CloudWatchDrift reports, alarms
RetrainEventBridge, Pipelines, Step FunctionsNew model version

SageMaker Pipelines pattern

Process raw data
  -> Train model
  -> Evaluate metrics
  -> If metric passes threshold:
         Register model package
         Optionally deploy to staging
     Else:
         Stop and record failure

CI/CD and governance distinctions

NeedPreferNotes
Version infrastructureAWS CloudFormation or AWS CDKReproducible environments
Build/test custom containersAWS CodeBuild + Amazon ECRScan and control images
Orchestrate release stagesAWS CodePipeline or equivalent CI/CDSeparate dev/test/prod
Trigger pipeline on data or approval eventAmazon EventBridgeEvent-driven retraining/deployment
Human approvalCodePipeline approval, Step Functions, or registry approval processUseful before production changes
Track experimentsSageMaker ExperimentsParameters, metrics, artifacts, lineage
Reproduce trainingPin code, image, dependencies, data version, hyperparameters, random seedsNot just “rerun notebook”

Security, privacy, and governance

IAM and access patterns

ControlExam-ready meaning
SageMaker execution roleRole assumed by SageMaker jobs/endpoints to access S3, ECR, CloudWatch, KMS, VPC resources
Least privilegeRestrict actions and resource ARNs, especially S3 prefixes and KMS keys
IAM user/role separationHuman identity starts jobs; execution role is used by managed service
Resource policiesS3 bucket policies, KMS key policies, ECR repository policies may also be required
Temporary credentialsPrefer IAM roles over long-lived static keys
Secrets ManagerStore database/API credentials; do not hardcode in notebooks or containers

Network and encryption controls

RequirementUseNotes
Encrypt S3 training data/artifactsS3 server-side encryption with AWS KMS where requiredExecution role needs KMS permissions
Encrypt training/inference volumesKMS key options where supportedInclude key policy permissions
Private training/inference network pathVPC configuration with private subnets/security groupsEnsure access to S3/ECR/CloudWatch through endpoints or controlled egress
No internet access from training containerNetwork isolation where appropriateContainer cannot fetch packages from internet
Private AWS service accessVPC endpoints/AWS PrivateLink where supportedAvoid public internet routes
Audit API callsCloudTrailWho changed endpoint, role, pipeline, bucket, key
Monitor logs/metricsCloudWatchOperational visibility
Detect sensitive data in S3MacieComplements, not replaces, access controls
Govern data lake permissionsAWS Lake FormationCentralized lake permissions over cataloged data

Security traps

TrapCorrect answer direction
AccessDenied from training job despite user accessCheck SageMaker execution role, bucket policy, KMS key policy
Private subnet job cannot pull image or read S3Add required VPC endpoints or controlled NAT path
KMS-encrypted S3 object unreadableExecution role needs both S3 and KMS decrypt permissions
Secret passed as plain environment variableUse Secrets Manager or secure parameter retrieval
Public notebook or endpoint exposureUse IAM, VPC, security groups, private access, and least privilege
Sensitive labeling dataUse private workforce and secure data access controls

Monitoring, observability, and troubleshooting

What to monitor

LayerTool/serviceSignals
Endpoint operationsCloudWatch metrics/logsInvocations, latency, errors, resource utilization
Training jobsCloudWatch logs, SageMaker job statusScript errors, metric output, resource failures
API activityCloudTrailCreate/update/delete endpoint, IAM, S3, KMS API calls
Input/output driftSageMaker Model Monitor data qualityFeature distribution changes
Model performanceSageMaker Model Monitor model qualityRequires ground truth labels
Bias driftSageMaker Clarify / Model Monitor integrationBias metric changes over time
Explainability driftClarify feature attribution monitoringFeature importance changes
Data captureSageMaker endpoint data capture to S3Inputs/outputs for monitoring and analysis

Troubleshooting decision table

SymptomLikely checks
Training job cannot access dataExecution role, S3 URI, bucket policy, KMS key policy, VPC endpoint
Training job starts but algorithm failsInput format, content type, channel names, hyperparameters, script error
Metrics not visible for tuningTraining script must emit metric matching tuning regex/definition
Endpoint creation failsModel artifact path, container image, IAM/ECR access, model load errors
Endpoint returns 4xxRequest format, content type, authentication, payload schema
Endpoint returns 5xxContainer logs, model exception, memory/timeout, dependency error
Latency increasesInstance sizing, concurrency, autoscaling, payload size, cold starts, model size
Production accuracy dropsData drift, label drift, feature skew, upstream schema change, stale features
Model Monitor has no quality reportGround truth labels may be missing or delayed
Costs unexpectedly highIdle endpoints/notebooks, overprovisioned instances, unnecessary always-on hosting
Batch job slowInput sharding, data format, instance choice, transform strategy
Pipeline did not triggerEventBridge rule, permissions, source event pattern, pipeline parameters

Cost-aware engineering choices

Cost pressurePractical pattern
Idle development environmentsStop notebooks/Studio apps when unused; use lifecycle controls where appropriate
Always-on endpoint with rare trafficConsider Serverless Inference, Asynchronous Inference, or Batch Transform
Many small modelsConsider multi-model endpoints
Large recurring batch scoringUse Batch Transform and right-size compute
Long training jobsUse checkpoints; consider managed spot training where suitable
OvertrainingUse early stopping and sensible tuning search spaces
Duplicate feature computationReuse Feature Store and shared processing jobs
Unused artifacts/logsApply S3 lifecycle policies and retention controls
Inefficient data formatPrefer columnar/compressed formats such as Parquet for analytics workloads

Scenario shortcuts

If the stem says…Likely answerWhy
“Run SQL on files in S3 without managing servers”Athena + Glue Data CatalogServerless query over data lake
“Infer schema from new S3 data”Glue crawlerPopulates catalog metadata
“Large-scale ETL with serverless Spark”AWS GlueManaged ETL
“Need full Spark cluster configuration control”EMRMore control than Glue
“Label images with human reviewers”SageMaker Ground TruthManaged labeling
“Avoid different feature code in training and inference”SageMaker Feature StoreReduces training-serving skew
“Train model reproducibly at scale”SageMaker Training jobManaged, containerized, repeatable
“Find best hyperparameters automatically”SageMaker automatic model tuningSearches parameter space
“Track parameters, metrics, and artifacts”SageMaker ExperimentsExperiment lineage
“Approve model before production”SageMaker Model RegistryModel package governance
“Deploy for millisecond-style request/response”Real-time endpointPersistent inference
“Score millions of records nightly”Batch TransformOffline batch predictions
“Requests can take longer and response can be stored in S3”Asynchronous InferenceQueued async processing
“Traffic is unpredictable and often idle”Serverless InferenceNo instance management
“Compare new model silently on production traffic”Shadow variantDoes not affect user response
“Detect input feature distribution drift”Model Monitor data qualityBaseline vs captured data
“Detect accuracy degradation after labels arrive”Model Monitor model qualityNeeds ground truth
“Who changed the endpoint configuration?”CloudTrailAPI audit
“Endpoint has high 5xx errors”CloudWatch logs + container diagnosticsOperational troubleshooting
“Use foundation model without hosting it”Amazon BedrockManaged FM API
“Add current company documents to FM answers”RAG / Knowledge Bases for Amazon BedrockRetrieves external knowledge
“Sensitive S3 training data may contain PII”Macie + IAM/KMS controlsDiscovery plus protection
“Private training with no internet”VPC config, endpoints, network isolationControlled network path

Final review checklist

  • Map every scenario to the lifecycle step: data, features, training, evaluation, deployment, monitoring, or governance.
  • Distinguish Athena vs Glue vs EMR, Pipelines vs Step Functions, and real-time vs async vs batch vs serverless inference.
  • For security questions, check execution role, S3 policy, KMS policy, VPC path, and CloudTrail.
  • For model quality questions, identify whether the issue is data quality, drift, bias, feature skew, evaluation metric choice, or deployment configuration.
  • For MLOps questions, prefer repeatable jobs, tracked artifacts, model registry approval, automated deployment, and monitoring-triggered retraining over manual notebook workflows.

Next step: use this Quick Reference as a drill sheet, then practice scenario questions that force you to choose the correct AWS service, deployment mode, monitoring control, or security fix for MLA-C01.

Browse Certification Practice Tests by Exam Family