Try 12 Splunk O11y Cloud Certified Metrics User sample questions and practice-test preview prompts on metrics, dimensions, charts, dashboards, detectors, alerts, latency, and service-health interpretation.
Splunk O11y Cloud Certified Metrics User is an observability route for candidates who interpret metrics, dimensions, charts, dashboards, detectors, alerts, latency, and service-health signals in Splunk Observability Cloud.
Use this page to try original IT Mastery sample questions on observability decisions. They are not official Splunk exam questions.
Practice option: Sample questions available
Start with the 12 sample questions on this page. Dedicated practice for Splunk O11y Metrics User is not currently included as a full web-app practice page; enter your email to get updates when full practice becomes available or expands for this exam.
Need live practice now? See currently available IT Mastery exam pages.
Topic: metrics
A service shows a sharp increase in request latency but no increase in request count. What should the analyst check next?
Best answer: B
Explanation: Latency can rise without traffic growth. Dependencies, resource saturation, deployments, and errors can explain the change.
Topic: dimensions
Why are dimensions useful in metric analysis?
Best answer: D
Explanation: Dimensions provide context for slicing and grouping metrics. They help isolate where a problem is happening.
Topic: dashboards
A dashboard should help on-call engineers triage service health. What is the best design?
Best answer: A
Explanation: Operational dashboards should answer triage questions quickly. Key health signals and context help engineers decide where to investigate.
Topic: detectors
An alert fires every night because batch workload behavior is expected at that time. What should be tuned?
Best answer: C
Explanation: Detectors should distinguish expected patterns from abnormal behavior. Tuning reduces alert fatigue while preserving signal.
Topic: alert routing
A database saturation alert should notify the database operations team, not the frontend team. What should be configured?
Best answer: B
Explanation: Alerts must reach the team that can act. Ownership and severity should drive routing and escalation.
Topic: latency
Which metric view best helps identify whether latency affects all users or only one region?
Best answer: D
Explanation: Grouping by region can expose localized issues hidden by global averages. Dimensions make this comparison possible.
Topic: errors
A service’s error rate increases after a deployment. What is the best interpretation?
Best answer: A
Explanation: Timing correlation is useful but not final proof. Teams should confirm with multiple signals and deployment context.
Topic: saturation
CPU usage is normal, but queue depth and response time are rising. What does this suggest?
Best answer: C
Explanation: Saturation is not only CPU. Queues, I/O, worker pools, and dependencies can create delays even with normal CPU.
Topic: baselines
Why can static thresholds be weak for highly seasonal workloads?
Best answer: B
Explanation: Seasonal systems can have predictable peaks and troughs. Thresholds should account for expected variation to reduce noise.
Topic: service dependency
A frontend service is slow, but its own CPU and memory look normal. What should be checked?
Best answer: D
Explanation: User-facing latency may come from dependencies. Observability should connect frontend symptoms to downstream causes.
Topic: incident review
After an incident, why review detectors and dashboards?
Best answer: A
Explanation: Post-incident review should improve observability. Teams should ask whether the available signals supported timely action.
Topic: service-level indicators
Which signal best aligns with user experience for an API?
Best answer: C
Explanation: User-facing success and latency are closer to service experience than cosmetic or infrastructure-only measures.
| If you miss… | Drill this next |
|---|---|
| metrics questions | time series, dimensions, grouping, and aggregation |
| alert questions | detector thresholds, routing, severity, and noise reduction |
| triage questions | latency, errors, saturation, dependencies, and deployment context |
| dashboard questions | user-facing signals, service ownership, and incident workflow |