Try 10 focused CompTIA Data+ DA0-002 questions on Data Analysis, with explanations, then continue with IT Mastery.
Open the matching IT Mastery practice page for timed mocks, topic drills, progress tracking, explanations, and full practice.
Try CompTIA Data+ DA0-002 on Web View full CompTIA Data+ DA0-002 practice page
| Field | Detail |
|---|---|
| Exam route | CompTIA Data+ DA0-002 |
| Topic area | Data Analysis |
| Blueprint weight | 24% |
| Page purpose | Focused sample questions before returning to mixed practice |
Use this page to isolate Data Analysis for CompTIA Data+ DA0-002. Work through the 10 questions first, then review the explanations and return to mixed practice in IT Mastery.
| Pass | What to do | What to record |
|---|---|---|
| First attempt | Answer without checking the explanation first. | The fact, rule, calculation, or judgment point that controlled your answer. |
| Review | Read the explanation even when you were correct. | Why the best answer is stronger than the closest distractor. |
| Repair | Repeat only missed or uncertain items after a short break. | The pattern behind misses, not the answer letter. |
| Transfer | Return to mixed practice once the topic feels stable. | Whether the same skill holds up when the topic is no longer obvious. |
Blueprint context: 24% of the practice outline. A focused topic score can overstate readiness if you recognize the pattern too quickly, so use it as repair work before timed mixed sets.
These original IT Mastery practice questions are aligned to this topic area. Use them for self-assessment, scope review, and deciding what to drill next.
Topic: Data Analysis
A data analyst owns a monthly revenue analysis that has passed prior validation. After the latest refresh, a manager reports that the dashboard total is 12% lower than the finance system. The source table is loaded nightly by another team, and the dashboard is due by noon. What is the best professional troubleshooting action before changing the analysis process?
Options:
A. Rebuild the analysis in a new tool
B. Change the revenue calculation to match finance
C. Compare refresh logs, row counts, and source totals
D. Manually adjust the dashboard total for noon
Best answer: C
Explanation: When a working analysis suddenly disagrees with a source of truth, the first professional step is evidence gathering. The analyst should verify whether the input changed, the refresh failed, rows were dropped, or source totals differ before modifying formulas or workflows. Compact checks such as refresh status, row counts, timestamps, null rates, and control totals help isolate whether the problem is in acquisition, preparation, calculation, or reporting. This preserves traceability and avoids introducing a new error into a validated process. The deadline matters, but it does not justify undocumented changes to make the output appear correct.
Topic: Data Analysis
A data analyst is preparing a weekly sales exception report for regional managers. The source table has order_status, ship_date, promised_date, payment_flag, and vip_customer fields. Managers need each order labeled Review, Expedite, or OK based on documented business rules, and null ship_date values must be treated as unshipped rather than discarded. Which approach is the BEST professional decision?
Options:
A. Create a CASE or IF calculation using AND/OR rules
B. Build a predictive model to score late orders
C. Calculate average shipping delay by region
D. Remove rows with null ship_date before labeling
Best answer: A
Explanation: Rule-based categories are best handled with logical functions such as IF, CASE, AND, OR, and sometimes NOT. In this scenario, the analyst must assign labels from business rules and keep unshipped orders visible. A calculated field can evaluate conditions such as late shipment, missing payment, or VIP status and return the correct category without changing the source data. Null handling should be explicit so unshipped orders are not accidentally excluded from the report. The key is to translate documented decision rules into transparent logic that can be reviewed and maintained.
ship_date values represent unshipped orders that managers need to review.Topic: Data Analysis
An analyst is preparing an evidence-based conclusion about customer churn for an executive update. The executive asks whether the latest churn figure proves churn increased today.
Exhibit: Metric validation summary
| Source | Refresh | Reliability | Preparation |
|---|---|---|---|
| CRM API pull | 15 min | Not certified; 7% duplicate customer_id | None; includes test accounts |
| Analytics mart | Nightly at 2:00 AM | Certified source of truth | Deduplicated; test accounts removed |
CRM API churn: 8.9%. Analytics mart churn: 7.4%, data current through yesterday.
Options:
A. Report 7.4% as proof churn did not increase today.
B. Report 7.4% through yesterday; today’s increase is not yet supported.
C. Report 8.9% because the CRM API is the freshest source.
D. Average both figures to balance freshness and reliability.
Best answer: B
Explanation: Evidence-based analysis conclusions should align the metric to the most reliable prepared source and the time period the data actually covers. In this case, the analytics mart is certified and applies the needed preparation steps, so it is the better source for the official churn figure. However, it refreshes nightly and is only current through yesterday. The fresher CRM API is not prepared and contains known duplicate and test-account issues, so its higher value should not be treated as official evidence of a same-day increase.
The key takeaway is that reliability and preparation can outweigh freshness, but refresh timing limits what conclusion can be made.
Topic: Data Analysis
A sales manager reports that the regional conversion rate on the monthly dashboard is “wrong” after a refresh. The KPI data dictionary defines conversion rate as closed_won_opportunities / qualified_leads. Your validation shows the warehouse row counts match the CRM extract, the dashboard calculation matches the data dictionary, and the manager’s spreadsheet uses all leads instead of qualified leads. The manager needs an answer before a leadership meeting. Which action is the BEST professional decision?
Options:
A. Change the dashboard formula to use all leads immediately
B. Open a high-priority data pipeline defect ticket
C. Close the issue without responding to the manager
D. Classify it as a definition misunderstanding and share the validation evidence
Best answer: D
Explanation: When a user reports a report issue, first separate misunderstanding from a true defect by checking the source data, calculation logic, filters, and documented business definition. In this case, both the source validation and dashboard calculation match the approved KPI definition. The discrepancy comes from the manager using a different denominator in a spreadsheet. The best response is to explain the KPI definition, provide the validation evidence, and document any requested definition change as a separate requirements discussion. This prioritizes the user’s urgent need without creating an unnecessary defect or changing a governed metric incorrectly.
Topic: Data Analysis
A data analyst is refreshing a CFO revenue variance dashboard due today. The approved source is the FinanceMart read-only warehouse, and local exports are not allowed because the dataset includes PII. A validation check shows the dashboard is still using last Friday’s extract.
Tool log:
Refresh failed: Cannot connect to FinanceMart
Error: authentication/endpoint timeout
Rows loaded: 0
Last successful refresh: Friday 18:00
Which action is the BEST professional decision?
Options:
A. Export the ERP data to a local spreadsheet
B. Verify the warehouse endpoint, VPN, and tool credentials
C. Publish the dashboard with the stale extract
D. Rewrite the revenue variance calculation
Best answer: B
Explanation: The issue is connectivity diagnosis, not analysis logic. The dashboard cannot reach the expected warehouse, loaded zero rows, and is still showing a stale extract. The best next step is to check the approved connection path: endpoint, VPN/network access, and the credentials or stored connection used by the BI tool. This keeps the report tied to the source of truth and respects the PII restriction against local exports. If access still fails after these checks, the analyst should escalate with the log evidence to the data platform or support team. Workarounds that use stale or locally copied data would compromise reporting quality or governance.
Topic: Data Analysis
An analyst is troubleshooting a regional revenue query that runs without an error but returns no rows. Which issue is best supported by the exhibit?
Data dictionary excerpt:
customers.customer_id: C001, C002, ...
customers.region: North, South, ...
orders.order_id: O1001, O1002, ...
orders.customer_id: C001, C002, ...
orders.amount: numeric
orders.order_date: date
Query:
SELECT c.region, SUM(o.amount) AS revenue
FROM customers c
JOIN orders o
ON c.customer_id = o.order_id
WHERE o.order_date >= '2025-01-01'
GROUP BY c.region;
Validation: orders has 1,248 rows on/after 2025-01-01.
Result: 0 rows
Options:
A. SUM(amount) uses the wrong aggregation function.
B. The JOIN matches customer_id to order_id.
C. region is missing from the GROUP BY clause.
D. The WHERE date filter excludes all orders.
Best answer: B
Explanation: This is a join-key issue. The query uses an inner join, so only rows with matching values on the join condition are kept. The data dictionary shows customers.customer_id and orders.customer_id share the same customer identifier pattern, while orders.order_id is a separate order identifier. Comparing C001-style customer IDs to O1001-style order IDs prevents valid customer-order matches, which explains the zero-row result. The validation note also shows the date filter should still leave many orders available. The key takeaway is to verify that joined columns represent the same business entity and compatible identifier domain.
region is already included in the GROUP BY clause.SUM(amount) is appropriate for revenue and would not explain zero joined rows.Topic: Data Analysis
A data analyst is asked whether a 2-week email campaign increased repeat purchases. The dashboard shows a lift, but the analyst notices a recent data-quality alert.
Exhibit: Campaign evidence summary
| Check | Result |
|---|---|
| Repeat purchase rate, targeted customers | 8.4% |
| Repeat purchase rate, holdout customers | 7.9% |
| Missing campaign flag | 18% of orders |
| Missingness pattern | Mostly from mobile checkout |
| Data alert | Campaign flag pipeline changed 3 days before launch |
Options:
A. Validate the campaign flag source and rerun the analysis
B. Impute missing campaign flags as not targeted
C. Exclude all mobile orders and publish the revised lift
D. Report that the campaign increased repeat purchases
Best answer: A
Explanation: Evidence-based conclusions should account for data quality before interpreting a KPI difference. Here, the observed lift is only 0.5 percentage points, while 18% of orders are missing the campaign flag. The missingness is not random because it is concentrated in mobile checkout, and a pipeline change occurred just before the campaign. That makes the campaign/holdout comparison unreliable until the source issue is investigated and the analysis is rerun with validated data. The defensible next action is to verify lineage and source extraction for the campaign flag rather than state a business conclusion from compromised evidence.
Topic: Data Analysis
A marketing analyst is preparing a report for executives to decide whether to increase next quarter’s advertising budget. The current dataset shows total monthly sales and total monthly ad spend for the past six months, but it does not include campaign dates, seasonality, pricing changes, or competitor activity. Which approach should the analyst take before recommending a budget increase?
Options:
A. Collect relevant context data before drawing a conclusion
B. Recommend increasing the budget because sales rose
C. Use only a line chart to show the trend
D. Average the six months to smooth the data
Best answer: A
Explanation: Evidence-based conclusions require enough relevant data to support the decision being made. Sales and ad spend alone may show a relationship, but they do not rule out other explanations such as promotions, seasonal demand, price changes, or competitor actions. Because executives are deciding whether to spend more money, the analyst should first obtain additional context data and then reassess the relationship. A visual or summary statistic can describe the limited dataset, but it cannot make the evidence strong enough for the decision.
Topic: Data Analysis
A data analyst is preparing one executive slide about a pilot campaign. The business wants to know whether to expand the campaign. The source was an A/B purchase dataset; validation found random assignment, but 8% of transactions had a missing campaign flag and were excluded.
| Group | n | Mean order value | 95% CI for mean difference | p-value |
|---|---|---|---|---|
| Control | 1,200 | $48.20 | ||
| Campaign | 1,180 | $52.10 | -$0.80 to $8.60 | 0.10 |
Which conclusion is best supported by the summary statistics?
Options:
A. The campaign caused higher spending for all customer segments.
B. The campaign had no effect because the p-value is greater than 0.05.
C. The campaign shows a higher observed mean, but the increase is inconclusive and exclusions should be disclosed.
D. The campaign should be expanded because the mean order value is $3.90 higher.
Best answer: C
Explanation: Summary statistics support an observed difference, not necessarily a statistically confirmed business outcome. Here, the campaign group’s mean order value is $3.90 higher, but the 95% confidence interval for the mean difference ranges from -$0.80 to $8.60. Because that interval includes zero and the p-value is 0.10, the evidence does not meet a typical 0.05 significance threshold. The analyst should also disclose that records with missing campaign flags were excluded, since that data-quality issue affects interpretation. The best professional conclusion is cautious: report the observed lift, state that it is not statistically conclusive, and note the exclusion before recommending expansion.
Topic: Data Analysis
A support analytics team is updating a dashboard for the business objective: increase the share of billing questions resolved through self-service instead of agent contact.
Exhibit: Available dashboard measures
| Measure | Definition |
|---|---|
| Help article views | Count of billing article page loads |
| Articles published | Count of new billing articles released |
| Self-service resolution rate | Billing help sessions with resolved feedback and no support ticket within 24 hours ÷ billing help sessions |
| Portal logins | Count of customer sign-ins to the help portal |
Which measure is the best KPI for this objective?
Options:
A. Portal logins
B. Self-service resolution rate
C. Articles published
D. Help article views
Best answer: B
Explanation: A KPI should directly reflect the business outcome, not just activity that may or may not contribute to it. The stated objective is to increase the share of billing questions resolved through self-service instead of agent contact. A resolution-rate measure connects the self-service session to both successful resolution feedback and the absence of a follow-up support ticket, so it is aligned to the desired outcome. Counts such as views, articles, or logins can be useful supporting metrics, but they do not prove that customers solved billing issues without contacting support.
The key takeaway is to choose outcome-based KPIs when the business objective is outcome-based.
Use the CompTIA Data+ DA0-002 Practice Test page for the full IT Mastery practice bank, mixed-topic practice, timed mock exams, explanations, and web/mobile app access.
Try CompTIA Data+ DA0-002 on Web View CompTIA Data+ DA0-002 Practice Test
Read the CompTIA Data+ DA0-002 Cheat Sheet on Tech Exam Lexicon, then return to IT Mastery for timed practice.