SOA-C03 — AWS Certified CloudOps Engineer – Associate Quick Reference

Compact AWS SOA-C03 quick reference for service selection, monitoring, automation, security, networking, reliability, and troubleshooting.

Exam-use orientation

This independent Quick Reference supports preparation for the AWS Certified CloudOps Engineer – Associate (SOA-C03) exam from AWS. Use it as a scenario decision guide: the exam often tests which AWS service, operational control, or troubleshooting step best fits a production operations problem.

CloudOps thinking pattern

Question asks about…First decide…Then choose based on…
MonitoringMetric, log, trace, event, or audit record?CloudWatch, X-Ray, EventBridge, CloudTrail, AWS Config
AutomationOne-time command, recurring desired state, patching, or workflow?AWS Systems Manager capability or AWS CloudFormation
Change managementInfrastructure template, app deployment, or instance replacement?CloudFormation, CodeDeploy, Auto Scaling instance refresh
ReliabilityHA in one Region or DR across Regions?Multi-AZ, backups, replication, Route 53 failover
SecurityIdentity, encryption, detection, or compliance evidence?IAM, KMS, CloudTrail, Config, GuardDuty, Security Hub
NetworkingRouting, DNS, firewalling, private access, or edge delivery?VPC route tables, Route 53, security groups/NACLs, VPC endpoints, CloudFront
Cost/performanceRightsizing, purchasing, data transfer, or storage tiering?Compute Optimizer, Cost Explorer, Budgets, lifecycle policies

Exam habit: eliminate answers that are manually operated, not highly available, not least privilege, or do not produce auditable operational evidence.

Core AWS operations service-selection matrix

Operational needPreferWhyCommon trap
Audit who called AWS APIsAWS CloudTrailRecords management events and optional data eventsCloudWatch Logs show app/system logs, not complete API audit history
Detect resource configuration drift/complianceAWS ConfigTracks resource configuration history and evaluates rulesCloudTrail tells who changed something, not whether current state is compliant
Alarm on metric thresholdAmazon CloudWatch alarmNative metric evaluation and actionsEventBridge is for event patterns, not continuous metric evaluation
Route AWS service events to targetsAmazon EventBridgeEvent bus, rules, schedules, SaaS/custom eventsCloudWatch alarm actions are limited to alarm state transitions
Centralize application logsCloudWatch LogsLog groups, retention, metric filters, Logs InsightsCloudTrail is not an application log platform
Run commands on managed instancesSystems Manager Run CommandRemote command execution without inbound SSH/RDPRequires SSM Agent, IAM role, and network path to Systems Manager endpoints
Enforce recurring instance configurationSystems Manager State ManagerMaintains desired state associationsRun Command is better for ad hoc execution
Patch EC2 or hybrid nodesSystems Manager Patch ManagerBaselines, maintenance windows, patch complianceUser data is not patch management
Secure shell access without opening portsSystems Manager Session ManagerAuditable sessions through SSMStill requires IAM permissions and managed instance connectivity
Automate operational runbookSystems Manager AutomationStep-based remediation workflowsLambda is useful for code, but Automation has runbook-native actions
Provision infrastructure as codeAWS CloudFormationDeclarative stacks, change sets, drift detectionCLI-created resources are harder to audit and reproduce
Deploy application revisionsAWS CodeDeployIn-place/blue-green deployment strategiesCloudFormation manages infrastructure; CodeDeploy manages app rollout
Replace Auto Scaling instances safelyEC2 Auto Scaling instance refreshGradual replacement using launch template/config changesUpdating the launch template alone does not replace existing instances
Central backup policyAWS BackupCross-service backup plans and vaultsSnapshots alone do not provide centralized policy/compliance views
Private access to AWS servicesVPC endpointsAvoid public internet paths for supported servicesNAT gateway provides outbound internet, not private service access
Edge caching and TLS termination near usersAmazon CloudFrontGlobal CDN, caching, origin protection optionsRoute 53 does DNS routing; it does not cache content
Detect suspicious account or workload activityAmazon GuardDutyThreat detection from logs and signalsSecurity Hub aggregates findings; it is not the primary detector
Aggregate security postureAWS Security HubConsolidates findings and standards checksConfig rules are resource compliance checks, not a findings hub
Discover sensitive data in S3Amazon MacieS3 data discovery and classificationS3 Inventory lists objects; it does not classify sensitive content
Analyze IAM external accessIAM Access AnalyzerIdentifies resource policies allowing external accessIAM credential report is about users/passwords/keys
Store database credentials with rotationAWS Secrets ManagerManaged secret lifecycle and rotation integrationParameter Store can store secrets, but rotation is not the same feature set
Store config parametersSystems Manager Parameter StoreHierarchical config values, optional encryptionDo not hard-code config in AMIs or user data
Manage encryption keysAWS KMSKey policies, grants, envelope encryption integrationIAM permission alone may not be enough; key policy must allow use
View AWS account health eventsAWS HealthService and account-specific operational eventsCloudWatch health checks monitor endpoints, not AWS account advisories
Govern accounts at scaleAWS OrganizationsConsolidated management, SCP guardrailsSCPs limit permissions; they do not grant permissions

Monitoring, logging, and event response

Observability services: high-yield distinctions

Service/featureBest forExam cuesNot best for
CloudWatch metricsNumeric time-series performance and healthCPU, latency, errors, queue depth, custom app metricFull audit trail or config history
CloudWatch alarmsThreshold, anomaly, or metric math alarm actionsNotify, scale, recover, stop, route incidentComplex event enrichment
CloudWatch LogsCentral log storage and searchApplication logs, OS logs, Lambda logs, VPC flow logs destinationLong-term object archive unless exported/archived
CloudWatch Logs InsightsAd hoc log query and troubleshootingFilter errors, aggregate by field, recent incident analysisPermanent business analytics warehouse
CloudWatch metric filtersTurn log patterns into metricsCount “ERROR” strings, unauthorized attemptsFree-form historical log analytics
CloudWatch AgentOS/process/custom metrics and logs from EC2/on-premMemory, disk, swap, app logsNative AWS service metrics that already exist
EventBridgeMatch events and route to targetsEC2 state change, scheduled automation, SaaS/custom busContinuous metric thresholding
CloudTrailAPI activity auditWho changed security group? Who deleted object?Instance CPU/memory monitoring
AWS ConfigResource inventory, config history, complianceIs S3 public access blocked? Has SG changed?User login/session troubleshooting
X-RayDistributed tracingService map, trace latency, segment errorsInfrastructure patch compliance

CloudWatch alarm decision points

DecisionChoose this when…Notes
Standard metric alarmOne metric or metric math expression is enoughMost common alarm scenario
Composite alarmNeed to reduce noise by combining alarm statesUse when multiple symptoms must be true before paging
Anomaly detectionNormal baseline varies over timeGood for cyclical traffic patterns
Treat missing data as breachingMissing metric is itself a failureUseful for heartbeat/custom metrics
Treat missing data as not breachingSilence can be normalAvoid false alarms for sparse metrics
Metric mathNeed derived signalExample: error percentage from errors and requests
Detailed monitoring/custom metricsNeed more granular or non-default dataMemory and disk require agent/custom metrics on EC2
Alarm action to Auto ScalingNeed scaling responseScaling policy should align with application behavior
Alarm action to SNS/EventBridge/Incident ManagerNeed notification or workflowUse structured incident routing for operations teams

CloudWatch Logs Insights patterns

fields @timestamp, @message
| filter @message like /ERROR|Exception|Timeout/
| sort @timestamp desc
| limit 50
fields @timestamp, @logStream, status, latency
| filter status >= 500
| stats count(*) as errors, avg(latency) as avgLatency by bin(5m)
| sort bin(5m) desc

Event-driven remediation pattern

Event sourceMatch withTarget examplesUse case
EC2 instance state changeEventBridge ruleLambda, Systems Manager Automation, SNSReact to stopped/terminated instances
AWS Health eventEventBridge ruleSNS, Incident Manager, ticket workflowNotify on account-specific AWS events
CloudTrail API eventEventBridge ruleLambda, Step Functions, SNSDetect high-risk API calls quickly
Scheduled eventEventBridge schedule/ruleSystems Manager Automation, LambdaRun maintenance tasks
Config compliance changeConfig rule/EventBridgeAutomation, SNSRemediate noncompliant resources

Automation, provisioning, and change management

CloudFormation operations reference

NeedCloudFormation featureExam note
Preview stack changesChange setSafer than direct update for production
Detect manual changesDrift detectionIdentifies resources that no longer match template where supported
Protect critical resource from replacement/deletionDeletionPolicy, UpdateReplacePolicy, stack policyUse Retain or Snapshot where appropriate
Reuse common templatesNested stacks/modulesGood for standardized patterns
Deploy to multiple accounts/RegionsStackSetsFits organization-scale rollout
Pass values between stacksOutputs and exportsAvoid hard-coded IDs
Create resources conditionallyConditionsUseful for environment-specific resources
Bootstrap EC2 on createUser data, cfn-init, cfn-signalSignals help CloudFormation wait for successful configuration
Roll back failed updateAutomatic rollback or continue update rollbackKnow how to recover stacks stuck during failed updates
Manage IAM resourcesCapabilities acknowledgmentIAM creation often requires explicit deployment capability
aws cloudformation validate-template \
  --template-body file://template.yaml

aws cloudformation deploy \
  --template-file template.yaml \
  --stack-name app-prod \
  --capabilities CAPABILITY_NAMED_IAM

aws cloudformation detect-stack-drift \
  --stack-name app-prod

Systems Manager capability map

CapabilityBest forRequires/depends on
Fleet ManagerInventory and manage nodesManaged instances
Session ManagerShell access without inbound portsSSM Agent, IAM, endpoint/internet connectivity
Run CommandExecute commands at scaleManaged instance role and target selection
State ManagerKeep configuration in desired stateAssociations and documents
Patch ManagerPatch baselines and complianceMaintenance windows optional but common
AutomationMulti-step runbooksIAM service role/permissions
DistributorInstall software packagesPackage definitions
Parameter StoreApp configuration and secure stringsKMS for encrypted secure strings
InventoryCollect software/config metadataSSM Agent and association
Maintenance WindowsScheduled operational tasksRegistered targets and tasks
OpsCenterTrack operational issuesIntegrates with alarms/events
Change ManagerControlled change workflowsApproval and change templates
aws ssm send-command \
  --document-name "AWS-RunShellScript" \
  --targets "Key=tag:Role,Values=web" \
  --parameters commands='["uptime","df -h"]'

Deployment choices

ScenarioPreferWhy
Deploy new Lambda version graduallyCodeDeploy with Lambda deployment configSupports traffic shifting and rollback
Deploy app to EC2 fleetCodeDeployLifecycle hooks and deployment groups
Replace EC2 instances using new launch templateAuto Scaling instance refreshOperationally simple fleet replacement
Manage immutable infrastructureCloudFormation + AMI/launch template + Auto ScalingReproducible state
Blue/green container deploymentECS deployment controller/CodeDeploy depending setupSafer traffic shifting
Manual emergency config changeSystems Manager Automation/Run CommandAuditable and repeatable
Infrastructure resource updateCloudFormation change setAvoid console drift
Complex orchestration across servicesStep Functions or Systems Manager AutomationChoose based on app workflow vs ops runbook

Compute, scaling, and load balancing

EC2 operational troubleshooting

SymptomCheck firstLikely direction
Instance unreachableSecurity group, NACL, route table, public/private IP, SSM statusSeparate network path problem from OS problem
System status check failedAWS host/network issueStop/start, recover, or allow AWS remediation depending scenario
Instance status check failedGuest OS/app issueCheck boot logs, CPU, disk, networking config
User data did not workCloud-init logs, script syntax, IAM role, network accessUser data normally runs at first boot unless configured otherwise
Cannot access S3 from private subnetRoute/NAT or S3 VPC endpoint policyPrefer gateway endpoint for private S3 access where appropriate
App lost AWS permissionsInstance profile, role policy, SCP/permission boundary, STS credentialsTemporary credentials come from role metadata
Memory/disk alarm missingCloudWatch Agent/custom metricsDefault EC2 metrics do not include all OS-level metrics
Replacement instance not configuredAMI, launch template, user data, SSM State ManagerAvoid snowflake instances

Auto Scaling decisions

NeedFeatureNotes
Maintain fixed capacityDesired/min/max capacityHealth checks replace failed instances
Scale around target metricTarget tracking policyCommon for CPU, request count, custom utilization metric
Scale by thresholds/stepsStep scalingUseful when response should vary by severity
Scale on scheduleScheduled scalingGood for predictable business hours
Prepare for future demandPredictive scalingUse when historical patterns are reliable
Let instances finish work before terminationLifecycle hooksPair with Lambda/SNS/SQS/Systems Manager
Use load balancer healthELB health checks in Auto ScalingReplaces instances failing app-level checks
Safely roll new launch templateInstance refreshCombine with health checks and warmup
Keep scale-in from killing special nodeInstance protectionUseful for stateful/critical instances, but avoid permanent snowflakes

Load balancer selection

Load balancerChoose forKey featuresAvoid when…
Application Load BalancerHTTP/HTTPS appsHost/path routing, redirects, header rules, WebSocket, target groupsNeed static IP at L4
Network Load BalancerTCP/UDP/TLS, high performance, static IP needsLow latency, source IP preservation patterns, TLS passthrough/terminationNeed advanced HTTP routing
Gateway Load BalancerThird-party virtual appliancesTransparent inspection with appliancesNormal web app load balancing
Classic Load BalancerLegacy workloadsOlder EC2-era optionNew architectures should usually choose ALB/NLB

ALB/NLB troubleshooting

ProblemCheck
Targets unhealthyTarget security group, health check path/port/protocol, app listener, NACL, target response code
502/503 errorsTarget availability, listener rules, target group health, backend timeouts
Client IP handlingALB uses headers; NLB can preserve source IP in supported patterns
TLS issueCertificate in ACM/IAM, listener protocol, SNI, security policy
Sticky sessions requiredALB target group stickiness or app-level session design
Slow scale-in connection dropsDeregistration delay and app graceful shutdown

Storage and database operations

Amazon S3 operations

NeedFeatureExam note
Block public exposureS3 Block Public Access + bucket policy reviewAccount-level and bucket-level controls matter
Audit object-level API accessCloudTrail data eventsManagement events alone do not show every object operation
Monitor bucket complianceAWS Config rulesGood for encryption, public access, versioning checks
Recover deleted/overwritten objectsVersioningLifecycle can manage old versions
Replicate objectsSame-Region or Cross-Region ReplicationVersioning is required for replication
Enforce encryptionDefault encryption and bucket policyKMS permissions must allow use when SSE-KMS is selected
Archive or tier objectsLifecycle policiesAlign transitions with access pattern
Prevent deletion/tamperingS3 Object Lock where configuredUnderstand governance/compliance retention behavior at concept level
Query object metadata/inventoryS3 Inventory/AthenaUseful for large-scale reporting
Protect origin contentCloudFront origin access control/origin access identity patternAvoid public bucket origins when private delivery is required

Block, file, and shared storage

ServiceChoose when…Operations focus
EBSBlock storage for one EC2 instance or supported clustered use caseSnapshots, encryption, volume type/performance, attachment, resizing
EFSShared Linux NFS file systemMount targets, security groups, access points, lifecycle policies
FSx for Windows File ServerManaged Windows SMB file sharesAD integration, backups, Windows workloads
FSx for LustreHigh-performance file system for compute workloadsS3 integration patterns, throughput-heavy jobs
Instance storeTemporary high-performance local storageData is ephemeral; do not use for durable state
S3Object storageEvent notifications, lifecycle, replication, access policies

Database operations

NeedRDS/Aurora featureKey distinction
High availability in a RegionMulti-AZ deploymentHA/failover, not read scaling by itself
Read scalingRead replicas/Aurora replicasCan also support some DR patterns
Point-in-time restoreAutomated backupsRestore creates a new DB resource
Manual long-term recovery pointDB snapshotOperationally controlled backup point
Reduce connection stormsRDS ProxyEspecially useful with spiky/serverless app connections
Diagnose DB loadPerformance Insights, Enhanced Monitoring, CloudWatchChoose based on query/database vs OS-level view
Change engine settingsParameter groupSome changes require reboot depending setting
Upgrade safelySnapshot, test, maintenance window, blue/green where availableAvoid untested production upgrades
Encrypt databaseKMS-backed encryption at creation/restore as supportedPlan key permissions and snapshot sharing behavior
NeedDynamoDB featureKey distinction
Automatic capacity adjustmentAuto scaling or on-demand capacity modeChoose based on predictability
Recover table to prior timePoint-in-time recoveryOperational recovery, not analytics
Global low-latency writes/readsGlobal tablesMulti-Region active-active pattern
React to item changesDynamoDB StreamsFeed Lambda/consumers
Expire old itemsTTLDeletion is asynchronous
Protect accidental deletionBackups, PITR, IAM controlsCloudFormation deletion policy may also matter

Networking and content delivery

VPC connectivity decision table

NeedChooseHigh-yield notes
Public IPv4 internet access for instancePublic subnet route to internet gateway + public IPSecurity group/NACL must allow traffic
Private subnet outbound IPv4 internetNAT gateway or NAT instanceNAT does not allow unsolicited inbound from internet
Private IPv6 outbound internetEgress-only internet gatewayIPv6 does not use NAT in the same way
Private access to S3/DynamoDBGateway VPC endpointRoute table association and endpoint policy matter
Private access to many AWS servicesInterface VPC endpointENI-based, security groups, private DNS option
Connect VPCs at scaleTransit GatewayHub-and-spoke routing; route tables still matter
Simple direct VPC-to-VPC connectivityVPC peeringNon-transitive; CIDR overlap is a blocker
Hybrid encrypted connectionSite-to-Site VPNFaster to establish than physical private connectivity
Dedicated private networkAWS Direct ConnectOften paired with VPN for encryption/backup design
DNS routing and failoverRoute 53Health checks and routing policies are central
Global static entry and accelerationAWS Global AcceleratorRoutes to healthy regional endpoints over AWS network
Cache static/dynamic content at edgeCloudFrontCache behavior, origin, TTL, invalidation, TLS

Security group vs NACL

ControlSecurity groupNetwork ACL
ScopeElastic network interface/resourceSubnet
StateStatefulStateless
RulesAllow rules onlyAllow and deny rules
EvaluationAll applicable rulesOrdered rule evaluation
Common useInstance/app firewallSubnet guardrail or explicit deny
Exam trapReturn traffic automatically allowedReturn traffic must be explicitly allowed

Route 53 routing policies

PolicyUse when…
SimpleSingle basic answer
WeightedSplit traffic by assigned proportions
Latency-basedSend users to lowest-latency Region
FailoverActive/passive with health checks
GeolocationRoute by user geographic location
GeoproximityRoute by location with optional bias
Multivalue answerReturn multiple healthy records
AliasPoint DNS to supported AWS resources without hard-coding IPs

VPC troubleshooting quick path

    flowchart TD
	    A[Connectivity failure] --> B{DNS resolves?}
	    B -- No --> C[Check Route 53/private hosted zone/resolver/DHCP options]
	    B -- Yes --> D{Route exists?}
	    D -- No --> E[Check route table, TGW, peering, IGW, NAT, endpoint]
	    D -- Yes --> F{Firewall allows?}
	    F -- No --> G[Check security groups and NACLs both directions]
	    F -- Yes --> H{Target healthy/listening?}
	    H -- No --> I[Check OS firewall, app port, ELB health check, instance status]
	    H -- Yes --> J[Check asymmetric routing, TLS, proxy, endpoint policy, IAM]

Security, identity, and compliance operations

IAM policy evaluation reference

ConceptExam meaning
Default denyNo permission unless allowed
Explicit denyOverrides any allow
Identity-based policyAttached to users, groups, or roles
Resource-based policyAttached to resource, such as S3 bucket, KMS key, Lambda function
Permissions boundaryMaximum permissions an identity can receive
SCPMaximum permissions for accounts/OUs in AWS Organizations; does not grant access
Session policyFurther restricts temporary session permissions
RoleAssumed for temporary credentials; preferred for AWS services and cross-account access
Instance profileDelivers IAM role credentials to EC2
Trust policyDefines who can assume a role
Access AnalyzerDetects unintended external access and validates policies
aws iam simulate-principal-policy \
  --policy-source-arn arn:aws:iam::111122223333:role/AppRole \
  --action-names s3:GetObject \
  --resource-arns arn:aws:s3:::example-bucket/example-key

Security and governance service selection

NeedServiceNotes
API audit logsCloudTrailEnable organization trail for multi-account visibility where appropriate
Resource complianceAWS ConfigManaged/custom rules, aggregators, conformance packs
Threat detectionGuardDutyFindings from account/workload signals
Security findings aggregationSecurity HubConsolidates findings and standards checks
Vulnerability scanningAmazon InspectorEC2, container, and Lambda vulnerability coverage depending configuration
S3 sensitive data discoveryMacieData classification focus
DDoS protectionAWS ShieldStandard is automatic; Advanced adds more protections/features
Web app filteringAWS WAFRules for HTTP/S traffic at ALB, CloudFront, API Gateway, etc.
Certificate managementAWS Certificate ManagerPublic/private cert lifecycle for integrated services
Secrets rotationSecrets ManagerRotation workflows and database integrations
Central account guardrailsAWS Organizations SCPsGuardrails only; IAM still grants actual permissions
Key managementKMSKey policy, IAM, grants, rotation settings, auditing

KMS operational distinctions

TopicRemember
Key policyPrimary control for KMS key access
IAM policyCan allow KMS actions only if key policy permits/delegates
GrantsCommon for AWS services needing temporary/delegated key use
AWS managed keyManaged by AWS for a service/account
Customer managed keyMore control over policy, rotation settings, auditing, deletion scheduling
Multi-Region keyUseful for client-side or service patterns needing related keys across Regions
Encryption contextAdditional authenticated data used by some integrations/policies
S3 SSE-KMS failureCheck both S3 permission and KMS key permission

Reliability, backup, and disaster recovery

Reliability design choices

RequirementPreferWhy
Survive instance failureAuto Scaling across Availability ZonesReplaces unhealthy capacity
Survive AZ failure for app tierMulti-AZ subnets + load balancer + Auto ScalingDistributes traffic and capacity
Survive DB instance/AZ failureRDS Multi-AZ or Aurora HA designManaged failover capability
Recover accidental deleteBackups, snapshots, versioning, PITRHA is not backup
Regional disaster recoveryCross-Region backups/replication + Route 53/Global Accelerator patternsDR requires runbooks and testing
Reduce noisy alertsComposite alarms and dependency-aware runbooksAvoid paging on downstream symptoms only
Validate resilienceGame days/failure testing where appropriateKnow rollback and blast radius
Standardize recoverySystems Manager Automation runbooksRepeatable operations beat manual steps

DR pattern reference

PatternCost/complexityOperational idea
Backup and restoreLowestRestore from backups when needed
Pilot lightLow/mediumCore components replicated; scale out during event
Warm standbyMedium/highScaled-down full environment already running
Active-activeHighestMultiple Regions actively serve traffic

Backup decision points

NeedUse
Centralized policy across supported servicesAWS Backup plans
EC2 volume recoveryEBS snapshots or AWS Backup
RDS database recoveryAutomated backups, snapshots, AWS Backup
S3 object recoveryVersioning, replication, Object Lock where configured
Cross-account backup isolationAWS Backup cross-account strategy
Cross-Region recoveryCross-Region backup/copy/replication
Accidental stack deletion protectionCloudFormation termination protection and deletion policies

Cost, performance, and operational hygiene

GoalTools/actionsExam note
Detect budget overrunAWS Budgets, Cost Explorer, cost anomaly detectionBudgets notify/control; Cost Explorer analyzes
Rightsize computeAWS Compute Optimizer, CloudWatch metricsNeeds enough metric history to make useful recommendations
Reduce idle resourcesFind unattached EBS volumes, idle load balancers, old snapshots, unused Elastic IPsTag ownership and lifecycle
Optimize S3 costLifecycle policies, storage class analysis, inventoryMatch storage class to access and retrieval needs
Reduce NAT dependencyVPC endpoints for supported AWS servicesOften improves private connectivity posture too
Control log costLog retention, filters, export/archive strategyInfinite retention can become expensive
Standardize tagsTag policies, Config rules, cost allocation tagsTags support cost, automation, and ownership
Improve app latencyALB/NLB choice, CloudFront, caching, database tuningDo not solve all latency with larger instances
Improve database performancePerformance Insights, read replicas, indexes/query tuning, cachingMulti-AZ is HA, not a read-scaling feature
Scale queues/workersSQS metrics + Auto Scaling/custom metricsScale on backlog per worker or latency-oriented metric

High-yield traps and distinctions

TrapCorrect exam reasoning
“Need to know who changed it” -> choose CloudWatchChoose CloudTrail for API audit; Config for configuration timeline
“Need to keep resource compliant” -> use CloudTrail onlyUse AWS Config rules or Systems Manager State Manager depending resource/config
“Private subnet needs S3 access” -> NAT is always bestGateway VPC endpoint is usually the private AWS-native path for S3
“Multi-AZ means backup”Multi-AZ is availability; backups/versioning/PITR handle recovery from bad changes
“Read replica means automatic HA failover for primary”Read replicas are primarily read scaling/DR; Multi-AZ is the HA answer for RDS primary failover
“SCP grants admin access”SCPs set maximum permissions; IAM/resource policies still grant
“Security group blocks with deny rule”Security groups allow only; NACLs can explicitly deny
“User data is configuration management”User data bootstraps; Systems Manager/CloudFormation maintain repeatable operations
“Changing launch template updates running instances”Existing instances remain until replaced, refreshed, or relaunched
“CloudFront replaces Route 53”CloudFront caches/distributes content; Route 53 resolves DNS and routes queries
“CloudWatch default EC2 metrics include memory”Memory/disk typically require CloudWatch Agent/custom metrics
“CloudTrail data events are always automatically logged”Know the distinction between management events and optional data-event logging
“KMS IAM allow is enough”Key policy, IAM policy, grants, and service integration all matter
“Public subnet equals internet reachable”Needs route, public address, firewall rules, and listening service
“NACL statefulness works like security groups”NACLs are stateless; return path rules matter

Rapid scenario drill table

If the scenario says…Fast answer direction
“No SSH allowed, but admins need shell access”Systems Manager Session Manager
“Run a command on all instances tagged Environment=Prod”Systems Manager Run Command
“Ensure a package remains installed”Systems Manager State Manager
“Patch instances during a defined window”Patch Manager + Maintenance Windows
“Preview infrastructure changes before update”CloudFormation change set
“Manual console changes caused drift”CloudFormation drift detection; remediate via template
“Notify on EC2 state changes”EventBridge rule
“Alarm when error rate exceeds threshold”CloudWatch metric/math alarm
“Search last hour of app errors”CloudWatch Logs Insights
“Count log pattern as metric”CloudWatch Logs metric filter
“Who opened port 22?”CloudTrail, then Config for current/history
“Block public S3 buckets across accounts”S3 Block Public Access, Config/SCP guardrails as appropriate
“Analyze whether bucket policy allows outside access”IAM Access Analyzer
“Database failover within Region”RDS Multi-AZ/Aurora HA
“Scale reads from database”Read replicas or cache layer
“Static website/global caching”CloudFront in front of S3 or origin
“Private service access from VPC”VPC endpoint
“Hybrid connection over internet”Site-to-Site VPN
“Central hub for many VPCs”Transit Gateway
“Filter malicious HTTP requests”AWS WAF
“Aggregate security findings”Security Hub
“Detect suspicious AWS account activity”GuardDuty
“Find sensitive data in S3”Macie
“Central backup policy and reporting”AWS Backup
“Cost forecast and historical spend”Cost Explorer
“Alert before budget is exceeded”AWS Budgets

Final review checklist

Before sitting for SOA-C03, be able to:

  • Pick between CloudWatch, CloudTrail, AWS Config, EventBridge, and Systems Manager without hesitation.
  • Troubleshoot EC2, Auto Scaling, ELB, Route 53, and VPC connectivity from symptoms.
  • Explain security group vs NACL, NAT vs VPC endpoint, Multi-AZ vs backup, and SCP vs IAM policy.
  • Choose operational automation: Run Command, State Manager, Patch Manager, Automation, CloudFormation, CodeDeploy, or instance refresh.
  • Map storage/database recovery needs to S3 versioning, EBS snapshots, RDS backups/PITR, DynamoDB PITR, and AWS Backup.
  • Recognize least-privilege, encryption, logging, tagging, and repeatable infrastructure patterns.

Next step: work through timed SOA-C03-style scenarios and force yourself to name the AWS service, the operational reason, and the first troubleshooting check before reading the explanation.

Browse Certification Practice Tests by Exam Family