Exam-use mental model
This independent Quick Reference supports candidates preparing for Google Cloud Professional Machine Learning Engineer - 2026 Guide from Google Cloud, exam code PMLE. Use it to rehearse service selection, ML lifecycle decisions, and common exam traps.
For most PMLE scenarios, choose the answer that best satisfies:
- Business objective first: metric, latency, freshness, cost, risk, explainability.
- Managed Google Cloud service when practical: reduce custom infrastructure unless requirements demand it.
- Reproducible ML lifecycle: version data, code, parameters, artifacts, metrics, and deployments.
- Secure by design: least privilege, private data paths, encryption, auditability.
- Monitor after deployment: prediction quality, drift, skew, latency, errors, and retraining triggers.
flowchart LR
A[Problem framing] --> B[Data ingestion and validation]
B --> C[Feature engineering]
C --> D[Training or tuning]
D --> E[Evaluation and model registry]
E --> F[Deployment: online, batch, or embedded]
F --> G[Monitoring and explainability]
G --> H[Retraining pipeline]
H --> C
High-yield Google Cloud service selection
Core ML and MLOps services
| Requirement | Prefer | Use when | Avoid when |
|---|
| End-to-end managed ML lifecycle | Vertex AI | Need managed training, tuning, model registry, endpoints, pipelines, metadata, monitoring | You only need simple SQL analytics or non-ML batch processing |
| Low-code model building | Vertex AI AutoML | Tabular, image, text, or similar tasks with limited custom architecture needs | Need full model architecture control, custom loss, specialized training loop |
| Custom model training | Vertex AI custom training | Need custom code, framework, containers, distributed training, GPUs/TPUs | SQL-native ML or AutoML meets requirements |
| SQL-native ML in warehouse | BigQuery ML | Data already in BigQuery; analysts use SQL; batch scoring is acceptable | Complex custom deep learning pipeline or low-latency online serving is required |
| Pretrained perception or language API | Google Cloud AI APIs / Vertex AI foundation models | Need fast integration without training from scratch | Need domain-specific model behavior, private fine-tuning, or custom serving logic |
| Pipeline orchestration for ML | Vertex AI Pipelines | Need reproducible ML steps, artifacts, metadata, lineage, scheduled retraining | General non-ML workflow is primary concern |
| General workflow orchestration | Cloud Composer or Workflows | Need broad DAG orchestration across services | Need ML-native lineage, artifacts, experiments, and model metadata |
| Experiment tracking | Vertex AI Experiments / TensorBoard | Need compare runs, parameters, metrics, artifacts | One-off notebook work with no reproducibility requirement |
| Model catalog and promotion | Vertex AI Model Registry | Need approve, version, deploy, rollback models | Model is never reused or governed |
| Online prediction | Vertex AI endpoints | Need managed low-latency serving, scaling, traffic splitting | Scoring can be delayed or done in bulk |
| Batch prediction | Vertex AI batch prediction or BigQuery ML batch scoring | Need score large datasets asynchronously | Interactive user request needs immediate response |
| Containerized custom inference | Vertex AI custom prediction container, Cloud Run, or GKE | Need custom pre/post-processing or nonstandard serving stack | Standard Vertex AI serving container is enough |
Data and analytics services for ML
| Requirement | Prefer | Why it fits |
|---|
| Large analytical warehouse, SQL features, BI, BQML | BigQuery | Serverless analytics, feature extraction, training data assembly, batch scoring |
| Object storage for raw data, images, model artifacts | Cloud Storage | Durable storage for datasets, exports, checkpoints, training packages |
| Stream ingestion | Pub/Sub | Decouples producers and consumers; supports event-driven pipelines |
| Stream/batch transformations | Dataflow | Managed Apache Beam for scalable ETL, windowing, streaming features |
| Existing Spark/Hadoop jobs | Dataproc | Managed Spark/Hadoop when reusing ecosystem code |
| Metadata, governance, discovery | Dataplex / Data Catalog capabilities | Helps classify, discover, and govern data assets |
| Secrets | Secret Manager | Avoid hardcoded credentials in notebooks, containers, or pipelines |
| Container images | Artifact Registry | Version custom training and serving containers |
| CI/CD | Cloud Build plus deployment tooling | Build, test, and promote ML code and containers |
PMLE decision tables
AutoML vs custom training vs BigQuery ML vs pretrained model
| Scenario clue | Best fit | Reasoning |
|---|
| “Fast baseline,” “limited ML expertise,” “standard tabular/image/text problem” | Vertex AI AutoML | Managed feature processing, training, tuning, and evaluation |
| “Custom loss,” “custom neural network,” “special preprocessing,” “research model” | Vertex AI custom training | Full code and framework control |
| “Data is in BigQuery,” “team uses SQL,” “batch predictions,” “no custom serving” | BigQuery ML | Keeps ML close to warehouse data and SQL workflows |
| “Need sentiment/OCR/speech/translation quickly” | Pretrained Google Cloud AI APIs | No training pipeline required |
| “Need private enterprise answers from documents” | Vertex AI foundation model with grounding/RAG | Injects current/private facts without training model from scratch |
| “Need specialized language/style/task adaptation” | Tuning on Vertex AI, when supported | Changes behavior more than prompting, less work than full custom training |
| “Strict control over weights, architecture, training data, serving” | Custom model on Vertex AI | Required when managed abstractions are insufficient |
Online vs batch prediction
| Requirement | Choose | Watch for |
|---|
| User-facing request/response | Online prediction endpoint | Latency, autoscaling, model size, input validation |
| Millions of rows scored overnight | Batch prediction | Throughput, output location, idempotency |
| Scores joined with warehouse tables | BigQuery ML prediction or batch output to BigQuery | SQL governance and reproducibility |
| Event-driven near-real-time scoring | Pub/Sub + Dataflow + online endpoint or streaming architecture | Backpressure, retry behavior, duplicate handling |
| Very low latency with custom serving logic | Cloud Run or GKE may be considered | More operational responsibility than managed endpoint |
| Model embedded on device or edge | Exported model format if supported | Update strategy, device constraints, monitoring limitations |
Pipeline orchestration choice
| Need | Prefer | Exam distinction |
|---|
| ML steps with artifacts, metadata, lineage | Vertex AI Pipelines | PMLE default for reproducible MLOps |
| Airflow DAG already orchestrates enterprise data platform | Cloud Composer | Good for heterogeneous scheduled workflows |
| Simple service-to-service workflow | Workflows | Lightweight orchestration, not ML-specific |
| Pure ETL transform at scale | Dataflow | Data processing engine, not experiment tracker |
| CI/CD build-test-deploy | Cloud Build | Build automation, not training lineage by itself |
Data preparation and feature engineering
Data split patterns
| Data type | Recommended split | Common trap |
|---|
| Independent tabular rows | Random or hash-based split | Non-reproducible random split causing changing metrics |
| Time series | Time-based split: train on past, validate on future | Random split leaks future information |
| User behavior | Split by user/entity when leakage across rows is possible | Same user appears in train and test |
| Image/text duplicates | Deduplicate or group before split | Near-duplicates inflate evaluation |
| Imbalanced classification | Stratified split when appropriate | Minority class disappears from validation/test |
| Streaming data | Hold out later time windows | Offline test set does not match production freshness |
Stable BigQuery split pattern:
SELECT
*,
CASE
WHEN MOD(ABS(FARM_FINGERPRINT(CAST(customer_id AS STRING))), 10) < 8 THEN 'TRAIN'
WHEN MOD(ABS(FARM_FINGERPRINT(CAST(customer_id AS STRING))), 10) = 8 THEN 'VALIDATE'
ELSE 'TEST'
END AS split
FROM `project.dataset.source_table`;
| Transform location | Use when | Risk |
|---|
| BigQuery SQL | Batch features, warehouse-native joins, aggregations | Training-serving skew if online path reimplements logic differently |
| Dataflow / Apache Beam | Streaming features, large-scale ETL, unified batch/stream processing | Operational complexity if simple SQL is enough |
tf.Transform-style pipeline step | Need identical training and serving transforms | More pipeline complexity |
| Model preprocessing layer | Transform must be packaged with model | Can increase serving latency |
| Feature store / online feature serving | Need consistent offline/online features and low-latency lookup | Requires governance around freshness and keys |
| Application code | Simple request formatting | High skew risk if business logic diverges from training |
Data quality checks to rehearse
| Check | Why it matters |
|---|
| Schema validation | Detects missing, renamed, or type-changed fields |
| Range checks | Finds impossible values, unit errors, and outliers |
| Null/missingness tracking | Missingness may be predictive or indicate broken ingestion |
| Label validation | Incorrect labels can cap model performance |
| Class distribution | Prevents misleading accuracy on imbalanced data |
| Train/serve feature parity | Reduces skew between offline training and online prediction |
| Duplicate detection | Prevents leakage and inflated metrics |
| Time-window correctness | Prevents future data from entering features |
| PII/sensitive data classification | Supports least privilege and privacy controls |
Modeling and evaluation reference
Metric selection
| Problem type | Prefer metrics | Use when | Trap |
|---|
| Balanced classification | Accuracy, log loss, AUC | Classes are roughly balanced and error costs similar | Accuracy hides minority-class failure |
| Imbalanced classification | Precision, recall, F1, PR AUC | Fraud, churn, abuse, rare disease, anomaly review queues | ROC AUC can look good while precision is poor |
| Ranking/recommendation | NDCG, MAP, precision@k, recall@k | Top results matter more than all predictions | Optimizing overall accuracy instead of ranked utility |
| Regression | RMSE, MAE, R-squared | Predict continuous values | RMSE over-penalizes large errors; MAE may hide severe outliers |
| Forecasting | MAE, RMSE, MAPE, weighted errors | Time-dependent demand or capacity | Random split and MAPE issues near zero values |
| Clustering | Silhouette, Davies-Bouldin, business validation | No labels available | Treating unsupervised score as proof of business value |
| Generative AI | Groundedness, factuality, safety, task success, human preference | Open-ended outputs | Evaluating only fluency and ignoring hallucination |
For binary classification:
\[
\text{Precision}=\frac{TP}{TP+FP}
\]\[
\text{Recall}=\frac{TP}{TP+FN}
\]\[
F1=2\cdot\frac{\text{Precision}\cdot\text{Recall}}{\text{Precision}+\text{Recall}}
\]\[
\text{Accuracy}=\frac{TP+TN}{TP+TN+FP+FN}
\]
Threshold and error-cost decisions
| Requirement | Decision |
|---|
| False positives are expensive | Increase precision; raise threshold |
| False negatives are expensive | Increase recall; lower threshold |
| Human review capacity is limited | Optimize precision@k or top-k workload |
| Regulatory or customer-impacting decision | Favor explainability, monitoring, audit logs, and human review |
| Need calibrated probabilities | Evaluate calibration, not just ranking |
| Class distribution shifts in production | Monitor prediction distribution and labels when available |
Model family shortcuts
| Model family | Strength | Weakness | Exam clue |
|---|
| Linear/logistic regression | Simple, interpretable, fast | Limited nonlinear patterns | Baseline, explainability |
| Tree-based models / boosted trees | Strong for tabular data, handles nonlinearities | Less ideal for raw images/text | Structured business data |
| Deep neural networks | Flexible, high capacity | Needs more data/tuning | Images, text, complex signals |
| CNNs | Spatial patterns | Image-specific architecture | Vision workloads |
| Transformers | Text, multimodal, generative tasks | Cost, latency, safety evaluation | NLP, LLM, embeddings |
| Matrix factorization / two-tower retrieval | Recommendations and retrieval | Cold-start handling needed | Users/items, candidate generation |
| Time-series models | Temporal structure | Must respect time ordering | Forecasting demand, capacity, traffic |
Generative AI and foundation-model decisions
| Requirement | Prefer | Why |
|---|
| Prototype text generation, summarization, extraction | Vertex AI foundation model prompting | Fastest path; no training data required |
| Need answers grounded in private documents | Retrieval-augmented generation | Keeps facts in external corpus and reduces stale knowledge risk |
| Need enterprise search over documents | Vertex AI Search / grounding-oriented architecture | Managed retrieval and relevance features |
| Need domain-specific style or task behavior | Model tuning, if supported for selected model | Adjusts behavior beyond prompt engineering |
| Need deterministic structured extraction | Prompt with schema, validation, and post-processing | LLM output should still be validated |
| Need safety controls | Safety settings, content filters, allowlists, human review | Do not rely only on prompt wording |
| Need evaluate generated answers | Human evaluation plus automated checks | Fluency is not enough |
Common generative AI traps:
- Using fine-tuning to add frequently changing facts when RAG is more appropriate.
- Ignoring grounding, citation, and hallucination checks.
- Sending sensitive data to prompts without access control and logging review.
- Evaluating only on “good looking” outputs instead of task-specific test sets.
- Forgetting latency and token cost tradeoffs for long prompts and large contexts.
Vertex AI lifecycle checkpoints
| Lifecycle step | What to remember for PMLE |
|---|
| Dataset creation | Validate schema, labels, splits, and access permissions before training |
| Training | Choose AutoML, custom training, BigQuery ML, or foundation model approach based on constraints |
| Hyperparameter tuning | Use when model class is appropriate but performance depends on configuration |
| Experiments | Track parameters, metrics, source version, data version, and artifact location |
| Model Registry | Version and promote models through review stages |
| Deployment | Choose endpoint, batch prediction, or custom serving target based on latency and throughput |
| Traffic splitting | Use for canary, A/B test, or gradual rollout where supported |
| Monitoring | Track skew, drift, prediction distribution, latency, errors, and business KPIs |
| Retraining | Automate with pipeline triggers, validation gates, and rollback plan |
Custom training pattern
Use custom training when you need control over model code, libraries, training loop, hardware, or distributed strategy.
High-yield components:
| Component | Purpose |
|---|
| Training container | Reproducible runtime with dependencies |
| Training service account | Reads training data, writes artifacts, logs metrics |
| Cloud Storage / Artifact Registry | Stores packages, containers, model artifacts |
| Vertex AI custom job | Runs managed training workload |
| Vertex AI hyperparameter tuning | Searches parameter space with managed trials |
| Vertex AI Experiments / Metadata | Tracks run lineage and metrics |
Minimal command-shape recognition:
gcloud ai custom-jobs create \
--region=REGION \
--display-name=JOB_NAME \
--worker-pool-spec=machine-type=MACHINE_TYPE,replica-count=1,container-image-uri=IMAGE_URI
Do not memorize flags as the main skill. For the exam, understand why a custom job is chosen and which service account, data path, artifact path, and region it uses.
BigQuery ML quick patterns
Use BigQuery ML when the training data is already in BigQuery and SQL-native training/scoring satisfies the requirement.
CREATE OR REPLACE MODEL `project.dataset.churn_model`
OPTIONS(
model_type = 'BOOSTED_TREE_CLASSIFIER',
input_label_cols = ['churned']
) AS
SELECT
* EXCEPT(customer_id, split)
FROM `project.dataset.training_features`
WHERE split = 'TRAIN';
SELECT
*
FROM ML.EVALUATE(
MODEL `project.dataset.churn_model`,
(
SELECT * EXCEPT(customer_id, split)
FROM `project.dataset.training_features`
WHERE split = 'TEST'
)
);
SELECT
customer_id,
predicted_churned,
predicted_churned_probs
FROM ML.PREDICT(
MODEL `project.dataset.churn_model`,
(
SELECT * EXCEPT(churned, split)
FROM `project.dataset.scoring_features`
)
);
BigQuery ML exam clues:
| Clue | Interpretation |
|---|
| Analysts prefer SQL | BigQuery ML is likely |
| Model training data is warehouse-native | Avoid unnecessary export |
| Need batch scoring into tables | BigQuery ML or Vertex AI batch prediction |
| Need low-latency endpoint | BigQuery ML alone is usually not the best fit |
| Need complex custom neural architecture | Use Vertex AI custom training instead |
MLOps, CI/CD, and reproducibility
What to version
| Asset | Why it matters |
|---|
| Source code | Reproduce training and serving behavior |
| Container image | Reproduce runtime dependencies |
| Data snapshot or query | Reproduce training set |
| Feature definitions | Prevent train/serve mismatch |
| Hyperparameters | Explain metric differences |
| Metrics and evaluation reports | Compare candidate models |
| Model artifact | Promote, rollback, and audit |
| Pipeline definition | Recreate workflow |
| Service account and IAM changes | Investigate access and security issues |
Deployment patterns
| Pattern | Use when | Watch for |
|---|
| Manual deployment | Prototype only | Not reproducible or auditable |
| CI/CD to staging then production | Production ML service | Add evaluation and approval gates |
| Canary deployment | Validate new model on small traffic share | Monitor error rates and KPIs |
| A/B testing | Compare business impact | Requires experiment design and unbiased assignment |
| Shadow deployment | Observe new model without affecting users | Needs duplicate inference and log analysis |
| Blue/green | Fast rollback between versions | Requires parallel environment readiness |
| Batch replacement | Periodic scoring pipeline | Validate output schema and downstream consumers |
Pipeline validation gates
| Gate | Failure should block |
|---|
| Data schema check | Training with incompatible data |
| Data quality threshold | Training on corrupt or incomplete data |
| Minimum evaluation metric | Promoting weak model |
| Fairness/slice metric regression | Shipping model harmful to a segment |
| Latency/load test | Deploying model that cannot serve traffic |
| Security scan | Deploying vulnerable container or dependency |
| Explainability or review requirement | Releasing opaque high-risk model |
Security, IAM, and governance
Security controls matrix
| Concern | Google Cloud control pattern | PMLE decision point |
|---|
| Least privilege | Use service accounts with minimal roles | Avoid broad Owner/Editor grants |
| Human access | Grant developers only needed Vertex AI, BigQuery, Storage, and service account permissions | Separate human identity from runtime identity |
| Runtime identity | Dedicated service account for training, pipeline, and prediction | Do not run jobs with personal credentials |
| Sensitive data | BigQuery policy tags, row/column-level controls, DLP-style inspection where appropriate | Protect training data and features |
| Network boundary | Private access patterns, VPC controls where required | Avoid public exposure for sensitive workloads |
| Encryption control | Google-managed encryption by default; CMEK where required | Ensure services and artifacts support required key usage |
| Secrets | Secret Manager | Do not bake secrets into images, notebooks, or environment files |
| Auditability | Cloud Audit Logs, Cloud Logging | Needed for regulated or high-risk ML workflows |
| Container supply chain | Artifact Registry, build scanning, pinned dependencies | Reproducible and reviewable deployments |
| Data exfiltration risk | VPC Service Controls where appropriate | Especially relevant for managed service access to sensitive data |
IAM role-shape reminders
| Principal | Needs | Avoid |
|---|
| Data scientist | Submit jobs, read approved datasets, view experiments | Broad project admin |
| Training job service account | Read training data, write model artifacts, write logs | Access to unrelated production data |
| Pipeline service account | Orchestrate pipeline components and pass approved runtime accounts | Ability to modify all IAM |
| Prediction service | Serve model and write logs/metrics | Access to raw training data unless required |
| CI/CD service account | Build containers, push artifacts, deploy approved models | Personal credentials or unrestricted production access |
| Monitoring operator | Read metrics/logs, acknowledge alerts | Ability to alter training data or models unnecessarily |
Monitoring and troubleshooting
Production ML monitoring signals
| Signal | What it detects |
|---|
| Prediction latency | Serving bottlenecks, oversized model, cold starts, downstream delays |
| Error rate | Bad inputs, container failures, dependency issues |
| Input feature distribution | Data drift, schema changes, source system changes |
| Training-serving skew | Different feature logic or data freshness between train and serve |
| Prediction distribution | Collapsed model, threshold issue, unexpected population shift |
| Ground-truth performance | Real model quality after labels arrive |
| Slice metrics | Degradation for specific user, region, product, or demographic segment |
| Resource utilization | Under/over-provisioning, accelerator bottlenecks |
| Business KPI | Whether ML improvement matters operationally |
Troubleshooting runbook
| Symptom | Likely causes | First actions |
|---|
| Custom training job fails immediately | Bad container entrypoint, missing dependency, IAM denial, invalid path | Check logs, image URI, service account permissions, artifact locations |
| Training cannot read data | Runtime service account lacks BigQuery/Storage access | Grant least-privilege read to the training service account |
| Out-of-memory during training | Batch too large, model too large, inefficient input pipeline | Reduce batch size, use larger machine, optimize data loading |
| Training is slow | Input bottleneck, no accelerator use, poor sharding, cross-region data | Co-locate resources, optimize input pipeline, profile workload |
| Great validation, poor production | Leakage, skew, nonrepresentative split, stale features | Rebuild split, compare train/serve features, inspect production distribution |
| Accuracy high but business impact poor | Wrong metric, imbalance, bad threshold | Optimize metric aligned to cost and decision process |
| Endpoint latency high | Model size, inefficient preprocessing, no batching, scaling config | Profile preprocessing/model, use efficient serving container, tune scaling |
| Drift alert fires | Source distribution changed, upstream bug, seasonality | Validate data source, compare slices, retrain only after quality review |
| Pipeline not reproducible | Unversioned data/code/image, nondeterministic split | Pin versions, use stable split, log parameters and artifacts |
| Permission error in pipeline | Wrong runtime account or missing pass-through permission | Identify executing principal and grant minimal required role |
Responsible AI and explainability
| Topic | Exam-ready action |
|---|
| Fairness | Evaluate metrics by relevant slices, not only aggregate score |
| Explainability | Use feature attribution/explanations where supported and meaningful |
| Bias in labels | Inspect label source and sampling process |
| Privacy | Minimize sensitive features and control access to raw data |
| Human oversight | Add review for high-impact automated decisions |
| Model cards / documentation | Record intended use, limitations, metrics, training data summary |
| Safety for generative AI | Evaluate harmful content, hallucination, leakage, and prompt injection |
| Monitoring | Watch for drift and quality regressions after deployment |
| Feedback loops | Avoid model decisions contaminating future labels without controls |
Common trap: “The model has high AUC, so it is ready.” PMLE-style answers often require slice evaluation, threshold selection, explainability, security review, and production monitoring before release.
Architecture patterns to recognize
Managed tabular prediction
| Layer | Typical choice |
|---|
| Data | BigQuery |
| Training | Vertex AI AutoML or BigQuery ML |
| Pipeline | Vertex AI Pipelines |
| Registry | Vertex AI Model Registry |
| Serving | Vertex AI endpoint for online; BigQuery ML or batch prediction for offline |
| Monitoring | Vertex AI monitoring plus business KPI tracking |
Choose this when the task is standard tabular ML and custom architecture is not required.
Streaming fraud or anomaly scoring
| Layer | Typical choice |
|---|
| Ingestion | Pub/Sub |
| Feature computation | Dataflow streaming |
| Storage | BigQuery for analytics; feature store/low-latency store if needed |
| Serving | Vertex AI endpoint or custom low-latency service |
| Monitoring | Latency, error rate, precision/recall after labels arrive |
Key traps: class imbalance, delayed labels, duplicate events, threshold tuning, false positive cost.
Batch forecasting
| Layer | Typical choice |
|---|
| Historical data | BigQuery or Cloud Storage |
| Feature creation | BigQuery SQL, Dataflow, or pipeline component |
| Training | BigQuery ML, AutoML, or custom training depending on complexity |
| Scoring | Scheduled batch prediction |
| Output | BigQuery table for downstream planning |
| Validation | Time-based backtesting |
Key trap: random split leaks future data.
Document-grounded generative AI
| Layer | Typical choice |
|---|
| Document ingestion | Controlled storage and indexing pipeline |
| Retrieval | Managed search/vector retrieval pattern |
| Generation | Vertex AI foundation model |
| Controls | Grounding, citations, safety settings, prompt injection defenses |
| Evaluation | Groundedness, factuality, task completion, human review |
Key trap: fine-tuning a model to memorize private documents when retrieval is the safer, fresher design.
Common PMLE traps
| Trap | Better answer |
|---|
| Choose custom Kubernetes for every ML workload | Prefer Vertex AI managed services unless requirements demand custom orchestration |
| Optimize accuracy on imbalanced data | Use precision, recall, F1, PR AUC, threshold tuning |
| Randomly split time-series data | Use time-based validation and backtesting |
| Ignore training-serving skew | Reuse transformation logic or centralize feature definitions |
| Train with personal credentials | Use dedicated service accounts |
| Store secrets in notebooks or containers | Use Secret Manager |
| Deploy model without monitoring | Add latency, errors, drift/skew, and quality monitoring |
| Fine-tune LLM to add changing facts | Use RAG/grounding for factual enterprise knowledge |
| Export BigQuery data unnecessarily | Use BigQuery ML or native integrations when suitable |
| Use online prediction for offline bulk scoring | Use batch prediction or warehouse scoring |
| Compare models on different data splits | Use consistent test data and logged experiments |
| Promote model based only on aggregate metric | Check slices, business cost, fairness, and operational constraints |
Last-minute checklist
Before answering a PMLE scenario question, identify:
- Task: classification, regression, forecasting, ranking, generation, clustering, anomaly detection.
- Data location: BigQuery, Cloud Storage, streaming, external source.
- Latency: online request, near-real-time stream, scheduled batch.
- Control level: AutoML, BigQuery ML, custom training, foundation model, pretrained API.
- Metric: aligned to business cost and class balance.
- Split: leakage-resistant and time-aware if needed.
- Pipeline: reproducible, versioned, and automated.
- Security: service accounts, least privilege, sensitive data controls.
- Deployment: endpoint, batch job, warehouse scoring, or custom serving.
- Monitoring: drift, skew, latency, errors, ground-truth performance.
- Rollback: model versioning, canary/shadow/blue-green where appropriate.
Practical next step
Use this Quick Reference to drill mixed PMLE scenarios. For every missed practice question, record the missed decision point: service choice, data split, metric, IAM boundary, deployment pattern, or monitoring control. Then redo similar questions until you can justify the Google Cloud design in one or two sentences.