AI-200 — Microsoft Azure AI Cloud Developer Associate Quick Reference

Compact AI-200 quick reference for Azure AI service selection, RAG patterns, security, deployment, and troubleshooting.

How to Use This Quick Reference

This independent Quick Reference is for candidates preparing for the Microsoft Azure AI Cloud Developer Associate (AI-200) exam. Use it to review decision points, service selection, implementation patterns, and common traps for Azure AI development scenarios.

Focus less on memorizing product names and more on answering: Which Azure AI service fits the requirement, how is it secured, how is it deployed, and how do you evaluate and troubleshoot it?

High-Yield Exam Map

AreaWhat to know for AI-200-style scenarios
Azure AI service selectionChoose between Azure OpenAI, Azure AI Search, Azure AI Document Intelligence, Azure AI Language, Azure AI Vision, Azure AI Speech, Translator, Content Safety, Azure Machine Learning, and app hosting services.
Generative AI developmentChat completions, embeddings, model deployments, prompt structure, tools/function calling, token management, response grounding, and evaluation.
Retrieval-augmented generationChunking, embeddings, vector indexes, hybrid search, semantic ranking, citations, access filtering, freshness, and hallucination reduction.
Knowledge miningAzure AI Search indexes, indexers, data sources, skillsets, enrichment pipelines, custom skills, and semantic/vector search.
Natural language, speech, vision, documentsSelect prebuilt vs custom models; distinguish OCR, form extraction, image analysis, transcription, translation, classification, and entity extraction.
Security and governanceMicrosoft Entra ID, managed identities, keys, Key Vault, RBAC, private networking, content filters, responsible AI controls, logging, and data protection.
Deployment and operationsApp Service, Azure Functions, Container Apps, AKS, API Management, queues, monitoring, retries, throttling, testing, and CI/CD.

Azure AI Service Selection Matrix

RequirementPreferWhyWatch for
Build a chat, summarization, reasoning, or code-assist featureAzure OpenAI Service or model deployments through Azure AI development toolingManaged access to large language models with Azure security and deployment controlsThe model name is not enough; apps call a deployment. Region and model availability matter.
Build, test, evaluate, and manage generative AI appsAzure AI Foundry toolingProject-based development, prompt workflows, evaluations, deployments, and model catalog workflowsDo not confuse design-time project tooling with the runtime app architecture.
Ground an LLM on enterprise documentsAzure AI Search + embeddings + Azure OpenAISupports keyword, vector, hybrid, semantic ranking, metadata filters, and citationsRetrieval does not guarantee correctness; still evaluate groundedness and safety.
Search structured and unstructured enterprise contentAzure AI SearchIndexes documents, supports filters, scoring, semantic ranking, vector search, and enrichmentIndex schema, analyzer choice, vector dimensions, and metadata fields are exam-relevant.
Extract fields from invoices, receipts, IDs, tax forms, or custom formsAzure AI Document IntelligencePrebuilt and custom document extraction modelsUse Document Intelligence for field extraction, not generic OCR-only scenarios.
OCR text from images or simple documentsAzure AI Vision Read/OCR or Document IntelligenceVision handles image OCR; Document Intelligence handles document-centric extractionIf the scenario needs key-value pairs/tables/forms, choose Document Intelligence.
Analyze images for captions, tags, objects, or visual featuresAzure AI VisionPrebuilt image analysis capabilitiesDo not choose Document Intelligence for general image tagging.
Classify images with custom labelsCustom Vision / custom image model workflowTrain image classification or object detection from labeled imagesUse only when prebuilt Vision features are insufficient.
Detect language, sentiment, key phrases, entities, or PIIAzure AI LanguagePrebuilt NLP APIsUse custom Language models when domain-specific labels, intents, or entities are needed.
Build intent recognition for a chatbotConversational Language UnderstandingMaps user utterances to intents and entitiesCLU identifies intent; it does not automatically complete business workflows.
Create FAQ-style question answering over curated contentCustom question answeringBest for controlled knowledge bases and FAQ-style responsesFor broad document retrieval plus generation, prefer RAG with Azure AI Search and an LLM.
Translate text between languagesTranslatorPurpose-built machine translationDo not use speech translation unless audio is involved.
Transcribe or synthesize speechAzure AI SpeechSpeech-to-text, text-to-speech, speech translation, custom speech scenariosBatch vs real-time and custom model requirements are common decision points.
Detect harmful, unsafe, or policy-violating contentAzure AI Content Safety plus model content filtersSafety classification for text/images and layered protection for generative AI appsSafety filters reduce risk; they are not a substitute for app authorization or validation.
Train, register, deploy, and monitor custom ML modelsAzure Machine LearningFull ML lifecycle for custom models, pipelines, endpoints, and MLOpsDo not choose Azure ML when a prebuilt Azure AI service satisfies the requirement.
Expose AI functionality through an APIApp Service, Azure Functions, Container Apps, AKS + API ManagementHosts app logic and protects/standardizes APIsThe AI service is not usually the entire application boundary.
Trigger AI processing from uploaded files/eventsEvent Grid, Service Bus, Storage Queue, Azure FunctionsEvent-driven ingestion and asynchronous processingUse queues for buffering, retries, and decoupling long-running AI tasks.

Core Azure AI Terms

TermMeaningExam distinction
Azure AI services resourceAzure resource used to access one or more cognitive services APIsMulti-service resources simplify management but not every scenario uses one shared endpoint.
Azure OpenAI resourceAzure resource for deploying and calling OpenAI models through AzureYou deploy a model before an app can call it.
ModelThe base AI capability, such as a chat model or embedding modelModel availability is not the same as deployment availability.
DeploymentNamed runtime instance of a model in an Azure OpenAI resourceIn many SDK calls, the model parameter is the deployment name.
EndpointNetwork address used by applications to call a serviceMay be public, restricted by firewall, or private through Private Link.
KeyShared secret for API accessSimpler but weaker operational model than managed identity. Store in Key Vault when used.
Managed identityMicrosoft Entra identity assigned to an Azure workloadPreferred for Azure-hosted apps calling Azure services without secrets.
RBACRole-based access control through Microsoft Entra IDSeparate management-plane permissions from data-plane permissions.
IndexSearchable structure in Azure AI SearchRequires a schema, fields, analyzers, and optionally vector fields.
IndexerCrawler that loads data from a supported source into an indexRuns on schedule or demand; does not run at query time.
SkillsetEnrichment pipeline for Azure AI Search indexingApplies OCR, extraction, translation, custom skills, or projections during indexing.
EmbeddingNumeric vector representation of text/imagesQuery and document embeddings must be generated with compatible models and dimensions.
ChunkSegment of a document indexed for retrievalBad chunking causes weak grounding even with a strong model.
Semantic rankingLanguage-aware ranking layer in Azure AI SearchOften combined with keyword/vector retrieval for better relevance.
Content filterSafety control applied to model inputs/outputsNot an authorization system and not a full business policy engine.

Generative AI Development Reference

Chat Completion Anatomy

ComponentPurposeCommon trap
System instructionSets assistant behavior, constraints, tone, and task rulesIt is not a security boundary. Always validate inputs, outputs, and tool calls.
User messageEnd-user requestUser content may contain prompt injection attempts.
Assistant messagePrior model responseLong histories consume tokens and may preserve bad context.
Tool/function definitionDescribes callable app functionsThe model suggests calls; your code authorizes and executes them.
Retrieved contextExternal data inserted into the promptMust be relevant, access-controlled, and cited when required.
Response formatControls structured output, such as JSONValidate schema after generation; do not assume perfect formatting.
Temperature/top-pControls randomnessLower values usually suit extraction, classification, and deterministic business tasks.
Max tokensCaps response lengthToo low truncates answers; too high can increase latency and cost.

Azure OpenAI SDK Pattern

Use Microsoft Entra ID or managed identity where possible for production workloads. API keys are common in simple examples but should be protected.

from azure.identity import DefaultAzureCredential, get_bearer_token_provider
from openai import AzureOpenAI

token_provider = get_bearer_token_provider(
    DefaultAzureCredential(),
    "https://cognitiveservices.azure.com/.default"
)

client = AzureOpenAI(
    azure_endpoint="https://<resource-name>.openai.azure.com/",
    azure_ad_token_provider=token_provider,
    api_version="<api-version>"
)

response = client.chat.completions.create(
    model="<deployment-name>",
    messages=[
        {"role": "system", "content": "Answer using the provided policy excerpt only."},
        {"role": "user", "content": "What is the refund window?"}
    ],
    temperature=0.2
)

print(response.choices[0].message.content)

High-yield detail: in Azure OpenAI calls, model="<deployment-name>" commonly refers to the Azure deployment name, not just the base model family.

Model Choice Decision Points

NeedPreferNotes
General chat, summarization, reasoning, extractionChat/completion modelChoose based on required quality, latency, cost, context length, and region availability.
Enterprise RAGChat model + embedding model + Azure AI SearchThe chat model generates; the embedding model retrieves.
Similarity searchEmbedding modelStore embeddings in a vector index; do not ask embeddings to generate answers.
Deterministic classification/extractionLower temperature, schema validation, possibly Azure AI Language or Document IntelligenceFor standard NLP/document tasks, prebuilt services may be more reliable and simpler.
Multimodal reasoningModel/service that supports the required input typeVerify whether the scenario needs image, text, audio, or document-native processing.
High-volume automationSmaller/faster model where acceptable, caching, batching, queueingAvoid using the largest model by default.
Regulated or sensitive workflowPrivate networking, managed identity, logging strategy, human review, content safetySecurity and governance often decide the architecture.

Retrieval-Augmented Generation Reference

RAG Flow

    flowchart LR
	    A[Source documents] --> B[Extract text and metadata]
	    B --> C[Chunk documents]
	    C --> D[Generate embeddings]
	    D --> E[Index in Azure AI Search]
	    U[User question] --> V[Embed query]
	    V --> W[Vector / hybrid retrieval]
	    W --> X[Rank, filter, and trim]
	    X --> Y[Prompt with retrieved context]
	    Y --> Z[Generate grounded answer with citations]
	    Z --> Q[Evaluate, log, and monitor]

RAG Design Matrix

Design choiceUse whenExam cuesTraps
Keyword searchExact terms, IDs, names, codes, or structured phrases matter“Find documents containing…”Poor semantic recall for paraphrased questions.
Vector searchUsers ask semantically similar but differently worded questions“Natural language questions over documents”Vector dimensions must match the embedding model.
Hybrid searchNeed both exact matching and semantic recall“Best relevance over enterprise content”Requires tuning scoring, filters, and ranking.
Semantic rankingNeed improved natural-language relevance and captions/answers“Improve result quality without retraining”It ranks retrieved candidates; it does not replace indexing.
Metadata filteringNeed access control, departments, dates, regions, document types“Only show documents user can access”Filter fields must exist and be populated in the index.
Security trimmingResults must respect user permissions“User-specific document access”Do not rely on the LLM to hide unauthorized text after retrieval.
Chunk overlapConcepts span boundaries between chunks“Answers miss context at page breaks”Too much overlap increases index size and duplicate retrieval.
CitationsUsers need traceability“Answer with sources”Citations require source metadata captured during ingestion.
FreshnessData changes often“New documents must appear quickly”Scheduled indexers may not meet near-real-time needs; consider push/event ingestion.
Human reviewHigh-impact or risky outputs“Approval required before action”Content filters alone may be insufficient.

Minimal Search Index Fields for RAG

FieldPurposeSearch configuration
idStable unique keyKey field
contentChunk text passed to the modelSearchable
contentVectorEmbedding for vector searchVector field with matching dimensions
titleHuman-friendly source labelSearchable/filterable as needed
sourceUriCitation link or storage referenceRetrievable
pageNumber / sectionCitation precisionFilterable/retrievable
lastModifiedFreshness filtering/sortingFilterable/sortable
acl / groupsSecurity trimmingFilterable
documentTypeFilter by policy, manual, contract, etc.Filterable/facetable

Vector/Hybrid Search SDK Shape

from azure.search.documents import SearchClient
from azure.search.documents.models import VectorizedQuery
from azure.identity import DefaultAzureCredential

search_client = SearchClient(
    endpoint="https://<search-service>.search.windows.net",
    index_name="<index-name>",
    credential=DefaultAzureCredential()
)

vector_query = VectorizedQuery(
    vector=query_embedding,
    k_nearest_neighbors=5,
    fields="contentVector"
)

results = search_client.search(
    search_text="refund policy for annual subscriptions",
    vector_queries=[vector_query],
    filter="documentType eq 'policy'",
    select=["title", "content", "sourceUri", "pageNumber"],
    top=5
)

Use this pattern to remember the separation between query embedding, vector retrieval, metadata filtering, and prompt construction.

Azure AI Search and Knowledge Mining

ComponentRoleWhen to useCommon issue
Data sourceConnection to supported content storePull data from Azure Storage, databases, or other supported sourcesPermissions and private networking can block indexers.
IndexerMoves data into an indexScheduled or on-demand indexingIt does not continuously reflect changes unless scheduled or triggered.
SkillsetEnriches content during indexingOCR, entity extraction, key phrases, translation, custom enrichmentSkills run at ingestion time, not at query time.
Custom Web API skillCalls your custom enrichment logicDomain-specific extraction, normalization, classificationMust handle scaling, failures, and expected schema.
Index projectionMaps enriched content into target index structuresParent-child or chunked indexing patternsIncorrect mapping leads to missing fields.
AnalyzerTokenization and text processingLanguage-specific search behavior, stemming, tokenizationAnalyzer choice affects matching and cannot always be casually changed later.
Synonym mapExpands equivalent termsIndustry acronyms, product aliasesSynonyms help keyword search but do not replace semantic/vector search.
Semantic configurationDefines prioritized fields for semantic rankingBetter captions/reranking for natural-language queriesNeeds meaningful title/content fields.
Vector profile/configurationEnables vector searchEmbedding-based retrievalEmbedding dimensions and vector field config must align.

Knowledge Mining vs RAG

ScenarioBetter answer
“Extract entities and key phrases from documents into a searchable index”Azure AI Search skillset with enrichment
“Ask natural-language questions and generate answers from indexed documents”RAG using Azure AI Search plus Azure OpenAI
“Search documents with filters, facets, and relevance scoring”Azure AI Search
“Summarize search results into a conversational response”Azure AI Search retrieval followed by generative model response
“Apply OCR before indexing scanned PDFs”Azure AI Search skillset with OCR, or Document Intelligence depending on extraction needs

Language, Speech, Vision, and Document Services

Azure AI Language

RequirementChooseNotes
Sentiment and opinion miningSentiment analysisIdentifies positive/negative/neutral sentiment and opinions where supported.
Extract names, places, organizations, datesNamed entity recognitionUse custom NER for domain-specific entities.
Detect sensitive personal dataPII detectionCombine with app policy for redaction, storage, and auditing.
Extract important termsKey phrase extractionUseful for tagging and indexing.
Identify languageLanguage detectionOften used before translation or language-specific processing.
Classify text into custom categoriesCustom text classificationRequires labeled examples and training/evaluation.
Extract domain-specific entitiesCustom named entity recognitionUse when prebuilt NER misses business-specific labels.
Detect user intent and entities in conversationsConversational Language UnderstandingGood for bot commands and routing.
FAQ-style answers from curated sourcesCustom question answeringBest for controlled knowledge base scenarios.

Azure AI Speech and Translator

RequirementChooseKey distinction
Convert microphone or audio files to textSpeech-to-textReal-time vs batch transcription matters.
Convert text to spoken audioText-to-speechVoice, language, and style requirements drive selection.
Translate textTranslatorText input/output.
Translate spoken audioSpeech translationAudio input with translation output.
Improve recognition for domain vocabularyCustom SpeechUse when baseline transcription struggles with accents, terms, or environment.
Build voice-enabled appSpeech SDK + app hostThe SDK handles audio interaction; your app handles business logic.

Azure AI Vision and Document Intelligence

RequirementChooseAvoid this mistake
Read text from an imageAzure AI Vision OCR/ReadDo not build a custom model for basic OCR.
Extract fields from formsAzure AI Document IntelligenceOCR alone does not produce structured fields reliably.
Extract tables from documentsDocument IntelligenceTables require document-aware layout extraction.
Use prebuilt invoice/receipt/ID extractionDocument Intelligence prebuilt modelDo not train custom if a prebuilt model satisfies the form type.
Extract from a custom business formDocument Intelligence custom modelNeeds representative labeled samples and evaluation.
Classify document types before extractionDocument classifier / routing patternRoute to the right extraction model.
Generate image tags/captionsAzure AI Vision image analysisDocument Intelligence is document-centric, not image-scene analysis.
Detect custom objects in imagesCustom Vision or custom vision model workflowRequires labeled images and model training.

Prompting, Tools, and Agentic Patterns

PatternUse whenImplementation reminder
Direct promptSimple transformation, drafting, summarization, or classificationKeep instructions explicit and constrain output format.
Few-shot promptNeed consistent style or labelsInclude representative examples; avoid excessive token use.
RAG promptNeed answers grounded in private/current dataRetrieve first, then generate using context and citation instructions.
Tool/function callingThe model needs live data or actionsValidate arguments, authorize user, execute tool in app code, then return result to model.
Planner/agent loopMulti-step tasks with tool useAdd iteration limits, logging, timeout, safety checks, and human approval for risky actions.
Structured outputDownstream system expects JSON or schemaValidate and retry/repair; never blindly trust generated JSON.
Prompt templateReusable prompt with variablesTreat retrieved/user content as data, not instructions.
Guardrail promptReduce unsafe behaviorHelpful but insufficient without content filters, authorization, and validation.

Tool Calling Control Points

StepControl
Tool definitionExpose only required functions and arguments.
User requestAuthenticate user and check authorization before tool execution.
Model-proposed tool callValidate name, arguments, types, ranges, and policy constraints.
Tool executionUse least-privilege identity and handle timeouts/retries.
Tool resultRemove secrets and unnecessary data before sending back to the model.
Final responseCheck safety, correctness, citation, and formatting requirements.

Security, Identity, and Governance

Authentication and Authorization Choices

OptionBest useExam distinction
API keySimple local testing or services that require key-based accessStore in Key Vault; rotate; avoid embedding in code or client apps.
Microsoft Entra IDEnterprise authentication and RBACPreferred for production when supported.
Managed identityAzure-hosted app calling Azure servicesAvoids secrets; assign least-privilege roles.
Service principalCI/CD or non-Azure workload automationProtect credentials; scope permissions tightly.
SAS tokenLimited delegated access to storage objectsTime-bound and permission-scoped; not an identity replacement.
Key VaultSecret, key, and certificate managementApp identity needs permission to retrieve secrets.

RBAC and Access Boundaries

BoundaryWhat it controlsCommon trap
Azure management planeCreate/update/delete resourcesContributor on a resource does not always grant data-plane read/write.
Data planeUse the service endpoint, indexes, models, or documentsRequires service-specific roles or keys.
Search index accessQuery or modify indexes/documentsSeparate query access from index administration.
Storage accessRead/write source documentsIndexers and apps need appropriate storage permissions.
Model deployment accessInvoke deployed modelsUsers/apps may need data-plane permission even if they can view the resource.
Application authorizationWhich user can perform business actionDo not delegate this decision to the model.

Network Isolation Checklist

RequirementControl
Keep traffic off public internet where supportedPrivate Endpoint / Private Link
Restrict public accessDisable or limit public network access and configure firewalls
Allow specific Azure services or networksService firewall rules and network integration
Resolve private endpoints correctlyPrivate DNS zone configuration
Secure app-to-service callsManaged identity plus private endpoint where supported
Protect inbound app APIsAPI Management, authentication, authorization, WAF where appropriate
Log security-relevant eventsAzure Monitor, diagnostics, app logs, and audit trails

Responsible AI and Safety Controls

ConcernPractical control
Harmful contentAzure AI Content Safety, model content filters, blocked categories, review workflow
HallucinationRAG grounding, citations, retrieval evaluation, refusal behavior when context is insufficient
Prompt injectionTreat retrieved/user text as untrusted, separate instructions from data, validate tool calls
Data leakageAccess filtering before retrieval, output redaction, least privilege, private networking
Bias or unfairnessRepresentative test sets, human review, metric tracking, documented limitations
OverrelianceConfidence indicators, citations, escalation paths, user education
Unsafe actionsHuman-in-the-loop approval, allowlisted tools, transaction limits, audit logs
PrivacyMinimize collected data, redact where appropriate, control logs and retention

Deployment Architecture Patterns

PatternUse whenAzure services commonly involved
Synchronous AI APIUser waits for responseApp Service / Container Apps / AKS, Azure OpenAI, Azure AI Search, API Management
Event-driven document ingestionFiles arrive asynchronouslyBlob Storage, Event Grid, Azure Functions, Azure AI Search, Document Intelligence
Long-running batch processingLarge document sets or audio transcriptionQueue/Service Bus, Functions/Container Apps, durable workflow pattern, Storage
Chat application with RAGConversational answers over enterprise dataWeb app, Azure OpenAI, Azure AI Search, storage, identity provider
Bot interfaceTeams/web chat integrationAzure Bot Service, CLU/Language, Azure OpenAI, backend APIs
Custom ML endpointModel trained outside prebuilt AI servicesAzure Machine Learning endpoint, app host, monitoring
Enterprise API facadeStandardized access to AI backendAPI Management, managed identity, rate limiting, logging, backend services
Private enterprise deploymentSensitive data and restricted accessPrivate Endpoints, VNet integration, managed identity, Key Vault, diagnostics

Hosting Service Selection

NeedPreferNotes
Simple web API or web appAzure App ServiceGood default for managed hosting.
Lightweight event handlerAzure FunctionsGood for triggers, ingestion, and glue logic.
Containerized microservice without Kubernetes overheadAzure Container AppsGood for scalable container workloads and background workers.
Full Kubernetes controlAKSUse when orchestration requirements justify complexity.
Workflow orchestration with connectorsLogic AppsGood for integration-heavy business workflows.
Durable stateful orchestrationDurable Functions patternGood for fan-out/fan-in and long-running workflows.

Monitoring, Evaluation, and Optimization

What to Log and Monitor

SignalWhy it matters
Request count, latency, failure rateBasic reliability and user experience
HTTP status codesDiagnose auth, throttling, quota, and service errors
Token usageCost, latency, and prompt optimization
Model deployment usedCompare quality, regressions, and routing decisions
Prompt/template versionReproduce failures and evaluate changes
Retrieval query and document IDsDebug grounding and citation issues
Safety filter outcomesMonitor blocked content and false positives/negatives
Tool calls and resultsAudit actions and diagnose agent behavior
User feedbackBuild evaluation datasets and prioritize fixes
Indexer statusDetect ingestion failures and stale search data

Evaluation Metrics by Scenario

ScenarioEvaluate
RAG answer generationGroundedness, relevance, citation correctness, answer completeness, refusal when context is insufficient
Search retrievalPrecision, recall, top-k relevance, filter correctness, freshness
ClassificationAccuracy, precision/recall, confusion matrix, threshold behavior
ExtractionField-level accuracy, missing fields, table accuracy, format validity
Chat assistantTask success, safety, latency, escalation rate, user satisfaction
Speech transcriptionWord error patterns, domain vocabulary recognition, speaker/audio conditions
Document extractionModel confidence, field accuracy, page/layout handling, exception routing

Optimization Levers

ProblemFirst levers to try
High latencySmaller/faster model, shorter prompts, fewer retrieved chunks, caching, streaming responses, async processing
High costToken reduction, response length limits, model selection, cache repeated answers, batch offline tasks
Poor groundingImprove chunking, add metadata, hybrid search, semantic ranking, better prompt constraints
Poor extractionUse purpose-built service, add examples, validate schema, choose custom model when prebuilt fails
Frequent throttlingBackoff/retry, queue requests, smooth traffic, request capacity planning
Inconsistent outputLower temperature, structured output, validation/retry, clearer instructions
Stale answersMore frequent indexing, event-driven ingestion, freshness filters

Troubleshooting Reference

SymptomLikely causeCheck
401 UnauthorizedMissing/invalid credentialEndpoint, key/token, managed identity configuration
403 ForbiddenAuthenticated but not authorizedRBAC role, data-plane permission, storage/search permissions
404 deployment not foundWrong Azure OpenAI deployment name or endpointResource endpoint, deployment name, region, API version
429 throttlingToo many requests or capacity pressureRetry-after handling, exponential backoff, queueing, traffic smoothing
5xx service errorsTransient platform/backend issueRetry with backoff, circuit breaker, monitor service health
Vector search returns no resultsWrong field, missing vectors, dimension mismatch, bad embeddingsIndex schema, embedding model, vector field config, indexed documents
Answers hallucinateWeak retrieval, prompt allows unsupported claims, no refusal ruleRetrieved chunks, citations, system instruction, evaluation set
Correct document not retrievedChunking/indexing/query mismatchChunk size, metadata filters, hybrid search, analyzers, synonyms
Citations are wrongSource metadata not stored or chunk mapping incorrectsourceUri, page/section fields, projection logic
Indexer failsData source permissions, unsupported file, skill error, mapping issueIndexer execution history and skillset outputs
Private endpoint connection failsDNS or network routing issuePrivate DNS zone, VNet links, firewall, public access setting
Document fields missingPrebuilt model mismatch or custom model undertrainedDocument type, sample quality, confidence scores
CLU predicts wrong intentOverlapping intents or weak utterance examplesTraining data balance, labels, examples, thresholds
Speech transcription poorAudio quality, noise, vocabulary, accent, wrong languageAudio preprocessing, custom speech, language config
Output JSON invalidModel not constrained or schema too complexStructured output, validation, repair/retry logic
Tool call unsafe or incorrectModel over-selected tool or bad argumentsTool allowlist, argument validation, user authorization

Common Exam Traps

  • Choosing Azure Machine Learning when a prebuilt Azure AI service already solves the task.
  • Choosing OCR when the requirement is structured form or table extraction; use Document Intelligence.
  • Treating a system prompt as a security control. It is guidance, not enforcement.
  • Letting the LLM decide whether a user is authorized to see retrieved content. Filter before retrieval or before prompt assembly.
  • Forgetting that Azure OpenAI apps call a deployment name, not just a model family.
  • Assuming embeddings generate answers. Embeddings support similarity search; a generative model writes the answer.
  • Ignoring metadata fields needed for filters, citations, freshness, and access control.
  • Using only vector search when exact IDs, codes, or names are important; consider hybrid search.
  • Assuming indexers run continuously. Understand scheduled, on-demand, and event-driven ingestion patterns.
  • Sending secrets, raw credentials, or excessive retrieved data into prompts.
  • Confusing management-plane RBAC with data-plane permissions.
  • Skipping retries and backoff for throttling and transient service failures.
  • Choosing the largest model automatically instead of balancing quality, latency, and cost.
  • Treating content filters as a full responsible AI program. They are one layer.

Scenario Answer Checklist

When you see an AI-200 scenario, identify:

  1. Input type: text, image, document, audio, structured data, or mixed.
  2. Task type: generate, retrieve, classify, extract, translate, transcribe, moderate, or train.
  3. Data source: static, frequently updated, private, user-specific, or public.
  4. Best service: prebuilt Azure AI service, Azure OpenAI, Azure AI Search, Azure ML, or a combination.
  5. Security model: managed identity, RBAC, Key Vault, private endpoint, access trimming.
  6. Runtime pattern: synchronous API, async queue, batch job, bot, web app, or containerized service.
  7. Quality controls: evaluation set, citations, validation, thresholds, human review.
  8. Operations: logging, monitoring, retries, throttling, cost/latency optimization.
  9. Responsible AI controls: content safety, privacy, fairness, transparency, escalation.

Practical Next Step

Use this Quick Reference to build a one-page service-selection map, then practice scenario questions where you must justify the Azure AI service, security controls, retrieval pattern, and troubleshooting step. Focus especially on RAG design, managed identity/RBAC, Azure AI Search indexing, and choosing prebuilt AI services over custom models when appropriate.

Browse Certification Practice Tests by Exam Family