Free CompTIA Data+ DA0-002 Practice Questions: Data Analysis

Last revised: July 14, 2026

Practice 10 free CompTIA Data+ V2 (CompTIA Data+ DA0-002) questions on Data Analysis, with answers, explanations, and the IT Mastery next step.

Try the IT Mastery web app for a richer interactive practice experience with mixed sets, timed mocks, topic drills, explanations, and progress tracking.

Try CompTIA Data+ DA0-002 on Web

Topic snapshot

Field	Detail
Practice target	CompTIA Data+ DA0-002
Topic area	Data Analysis
Blueprint weight	24%
Page purpose	Focused sample questions before returning to mixed practice

How to use this topic drill

Use this page to isolate Data Analysis for CompTIA Data+ DA0-002. Work through the 10 questions first, then review the explanations and return to mixed practice in IT Mastery.

Pass	What to do	What to record
First attempt	Answer without checking the explanation first.	The fact, rule, calculation, or judgment point that controlled your answer.
Review	Read the explanation even when you were correct.	Why the best answer is stronger than the closest distractor.
Repair	Repeat only missed or uncertain items after a short break.	The pattern behind misses, not the answer letter.
Transfer	Return to mixed practice once the topic feels stable.	Whether the same skill holds up when the topic is no longer obvious.

Blueprint context: 24% of the practice outline. A focused topic score can overstate readiness if you recognize the pattern too quickly, so use it as repair work before timed mixed sets.

Sample questions

These are original IT Mastery practice questions aligned to this topic area. They are not official CompTIA questions, copied live-exam content, or exam dumps. Use them to preview question style and explanation depth before continuing with topic drills, mixed sets, and timed mocks in IT Mastery.

Question 1

Topic: Data Analysis

A data analyst owns a monthly revenue analysis that has passed prior validation. After the latest refresh, a manager reports that the dashboard total is 12% lower than the finance system. The source table is loaded nightly by another team, and the dashboard is due by noon. What is the best professional troubleshooting action before changing the analysis process?

Options:

A. Rebuild the analysis in a new tool
B. Change the revenue calculation to match finance
C. Compare refresh logs, row counts, and source totals
D. Manually adjust the dashboard total for noon

Best answer: C

Explanation: When a working analysis suddenly disagrees with a source of truth, the first professional step is evidence gathering. The analyst should verify whether the input changed, the refresh failed, rows were dropped, or source totals differ before modifying formulas or workflows. Compact checks such as refresh status, row counts, timestamps, null rates, and control totals help isolate whether the problem is in acquisition, preparation, calculation, or reporting. This preserves traceability and avoids introducing a new error into a validated process. The deadline matters, but it does not justify undocumented changes to make the output appear correct.

Changing the calculation assumes the analysis is wrong before confirming whether the source load or data changed.
Manual adjustment may satisfy the deadline superficially, but it breaks traceability and hides the root cause.
Rebuilding the tool is overengineering and does not target the immediate evidence needed to diagnose the discrepancy.

Question 2

Topic: Data Analysis

A data analyst is preparing a weekly sales exception report for regional managers. The source table has order_status, ship_date, promised_date, payment_flag, and vip_customer fields. Managers need each order labeled Review, Expedite, or OK based on documented business rules, and null ship_date values must be treated as unshipped rather than discarded. Which approach is the BEST professional decision?

Options:

A. Create a CASE or IF calculation using AND/OR rules
B. Build a predictive model to score late orders
C. Calculate average shipping delay by region
D. Remove rows with null ship_date before labeling

Best answer: A

Explanation: Rule-based categories are best handled with logical functions such as IF, CASE, AND, OR, and sometimes NOT. In this scenario, the analyst must assign labels from business rules and keep unshipped orders visible. A calculated field can evaluate conditions such as late shipment, missing payment, or VIP status and return the correct category without changing the source data. Null handling should be explicit so unshipped orders are not accidentally excluded from the report. The key is to translate documented decision rules into transparent logic that can be reviewed and maintained.

Averages miss categories because a regional delay metric does not label individual orders for action.
Predictive scoring overreaches because the requirement is documented rule application, not forecasting.
Dropping null shipments fails because null ship_date values represent unshipped orders that managers need to review.

Question 3

Topic: Data Analysis

An analyst is preparing an evidence-based conclusion about customer churn for an executive update. The executive asks whether the latest churn figure proves churn increased today.

Exhibit: Metric validation summary

Source	Refresh	Reliability	Preparation
CRM API pull	15 min	Not certified; 7% duplicate `customer_id`	None; includes test accounts
Analytics mart	Nightly at 2:00 AM	Certified source of truth	Deduplicated; test accounts removed

CRM API churn: 8.9%. Analytics mart churn: 7.4%, data current through yesterday.

Options:

A. Report 7.4% as proof churn did not increase today.
B. Report 7.4% through yesterday; today’s increase is not yet supported.
C. Report 8.9% because the CRM API is the freshest source.
D. Average both figures to balance freshness and reliability.

Best answer: B

Explanation: Evidence-based analysis conclusions should align the metric to the most reliable prepared source and the time period the data actually covers. In this case, the analytics mart is certified and applies the needed preparation steps, so it is the better source for the official churn figure. However, it refreshes nightly and is only current through yesterday. The fresher CRM API is not prepared and contains known duplicate and test-account issues, so its higher value should not be treated as official evidence of a same-day increase.

The key takeaway is that reliability and preparation can outweigh freshness, but refresh timing limits what conclusion can be made.

Fresh but unprepared fails because the CRM API has known duplicates and test accounts.
Averaging sources fails because combining an uncertified raw pull with a certified mart does not fix data-quality differences.
Overstating the mart fails because yesterday’s certified data cannot prove what happened today.

Question 4

Topic: Data Analysis

A sales manager reports that the regional conversion rate on the monthly dashboard is “wrong” after a refresh. The KPI data dictionary defines conversion rate as closed_won_opportunities / qualified_leads. Your validation shows the warehouse row counts match the CRM extract, the dashboard calculation matches the data dictionary, and the manager’s spreadsheet uses all leads instead of qualified leads. The manager needs an answer before a leadership meeting. Which action is the BEST professional decision?

Options:

A. Change the dashboard formula to use all leads immediately
B. Open a high-priority data pipeline defect ticket
C. Close the issue without responding to the manager
D. Classify it as a definition misunderstanding and share the validation evidence

Best answer: D

Explanation: When a user reports a report issue, first separate misunderstanding from a true defect by checking the source data, calculation logic, filters, and documented business definition. In this case, both the source validation and dashboard calculation match the approved KPI definition. The discrepancy comes from the manager using a different denominator in a spreadsheet. The best response is to explain the KPI definition, provide the validation evidence, and document any requested definition change as a separate requirements discussion. This prioritizes the user’s urgent need without creating an unnecessary defect or changing a governed metric incorrectly.

Formula change fails because it would alter a governed KPI without evidence that the approved definition changed.
Pipeline defect fails because the extract and warehouse counts already validated successfully.
No response fails because the manager has an urgent business need and deserves a documented explanation.

Question 5

Topic: Data Analysis

A data analyst is refreshing a CFO revenue variance dashboard due today. The approved source is the FinanceMart read-only warehouse, and local exports are not allowed because the dataset includes PII. A validation check shows the dashboard is still using last Friday’s extract.

Tool log:

Refresh failed: Cannot connect to FinanceMart
Error: authentication/endpoint timeout
Rows loaded: 0
Last successful refresh: Friday 18:00

Which action is the BEST professional decision?

Options:

A. Export the ERP data to a local spreadsheet
B. Verify the warehouse endpoint, VPN, and tool credentials
C. Publish the dashboard with the stale extract
D. Rewrite the revenue variance calculation

Best answer: B

Explanation: The issue is connectivity diagnosis, not analysis logic. The dashboard cannot reach the expected warehouse, loaded zero rows, and is still showing a stale extract. The best next step is to check the approved connection path: endpoint, VPN/network access, and the credentials or stored connection used by the BI tool. This keeps the report tied to the source of truth and respects the PII restriction against local exports. If access still fails after these checks, the analyst should escalate with the log evidence to the data platform or support team. Workarounds that use stale or locally copied data would compromise reporting quality or governance.

Local export violates the stated PII handling restriction and bypasses the approved warehouse source.
Stale extract fails the CFO reporting need because validation already identified the data as outdated.
Calculation rewrite does not address the logged authentication or endpoint timeout.

Question 6

Topic: Data Analysis

An analyst is troubleshooting a regional revenue query that runs without an error but returns no rows. Which issue is best supported by the exhibit?

Data dictionary excerpt:
customers.customer_id: C001, C002, ...
customers.region: North, South, ...
orders.order_id: O1001, O1002, ...
orders.customer_id: C001, C002, ...
orders.amount: numeric
orders.order_date: date

Query:
SELECT c.region, SUM(o.amount) AS revenue
FROM customers c
JOIN orders o
  ON c.customer_id = o.order_id
WHERE o.order_date >= '2025-01-01'
GROUP BY c.region;

Validation: orders has 1,248 rows on/after 2025-01-01.
Result: 0 rows

Options:

A. SUM(amount) uses the wrong aggregation function.
B. The JOIN matches customer_id to order_id.
C. region is missing from the GROUP BY clause.
D. The WHERE date filter excludes all orders.

Best answer: B

Explanation: This is a join-key issue. The query uses an inner join, so only rows with matching values on the join condition are kept. The data dictionary shows customers.customer_id and orders.customer_id share the same customer identifier pattern, while orders.order_id is a separate order identifier. Comparing C001-style customer IDs to O1001-style order IDs prevents valid customer-order matches, which explains the zero-row result. The validation note also shows the date filter should still leave many orders available. The key takeaway is to verify that joined columns represent the same business entity and compatible identifier domain.

Date filter fails because the validation note confirms many orders exist on or after the filtered date.
GROUP BY issue fails because region is already included in the GROUP BY clause.
Aggregation choice fails because SUM(amount) is appropriate for revenue and would not explain zero joined rows.

Question 7

Topic: Data Analysis

A data analyst is asked whether a 2-week email campaign increased repeat purchases. The dashboard shows a lift, but the analyst notices a recent data-quality alert.

Exhibit: Campaign evidence summary

Check	Result
Repeat purchase rate, targeted customers	8.4%
Repeat purchase rate, holdout customers	7.9%
Missing campaign flag	18% of orders
Missingness pattern	Mostly from mobile checkout
Data alert	Campaign flag pipeline changed 3 days before launch

Options:

A. Validate the campaign flag source and rerun the analysis
B. Impute missing campaign flags as not targeted
C. Exclude all mobile orders and publish the revised lift
D. Report that the campaign increased repeat purchases

Best answer: A

Explanation: Evidence-based conclusions should account for data quality before interpreting a KPI difference. Here, the observed lift is only 0.5 percentage points, while 18% of orders are missing the campaign flag. The missingness is not random because it is concentrated in mobile checkout, and a pipeline change occurred just before the campaign. That makes the campaign/holdout comparison unreliable until the source issue is investigated and the analysis is rerun with validated data. The defensible next action is to verify lineage and source extraction for the campaign flag rather than state a business conclusion from compromised evidence.

Positive conclusion fails because the lift may be an artifact of missing or misclassified campaign flags.
Dropping mobile orders may introduce selection bias and does not fix the underlying source-quality issue.
Default imputation is unsupported because treating missing flags as not targeted could systematically misclassify orders.

Question 8

Topic: Data Analysis

A marketing analyst is preparing a report for executives to decide whether to increase next quarter’s advertising budget. The current dataset shows total monthly sales and total monthly ad spend for the past six months, but it does not include campaign dates, seasonality, pricing changes, or competitor activity. Which approach should the analyst take before recommending a budget increase?

Options:

A. Collect relevant context data before drawing a conclusion
B. Recommend increasing the budget because sales rose
C. Use only a line chart to show the trend
D. Average the six months to smooth the data

Best answer: A

Explanation: Evidence-based conclusions require enough relevant data to support the decision being made. Sales and ad spend alone may show a relationship, but they do not rule out other explanations such as promotions, seasonal demand, price changes, or competitor actions. Because executives are deciding whether to spend more money, the analyst should first obtain additional context data and then reassess the relationship. A visual or summary statistic can describe the limited dataset, but it cannot make the evidence strong enough for the decision.

Trend-only reasoning fails because rising sales do not prove advertising caused the increase.
Chart formatting may improve communication, but it does not fix missing decision-critical evidence.
Averaging values can reduce noise, but it can also hide important variation and still omits key context.

Question 9

Topic: Data Analysis

A data analyst is preparing one executive slide about a pilot campaign. The business wants to know whether to expand the campaign. The source was an A/B purchase dataset; validation found random assignment, but 8% of transactions had a missing campaign flag and were excluded.

Group	n	Mean order value	95% CI for mean difference	p-value
Control	1,200	$48.20
Campaign	1,180	$52.10	-$0.80 to $8.60	0.10

Which conclusion is best supported by the summary statistics?

Options:

A. The campaign caused higher spending for all customer segments.
B. The campaign had no effect because the p-value is greater than 0.05.
C. The campaign shows a higher observed mean, but the increase is inconclusive and exclusions should be disclosed.
D. The campaign should be expanded because the mean order value is $3.90 higher.

Best answer: C

Explanation: Summary statistics support an observed difference, not necessarily a statistically confirmed business outcome. Here, the campaign group’s mean order value is $3.90 higher, but the 95% confidence interval for the mean difference ranges from -$0.80 to $8.60. Because that interval includes zero and the p-value is 0.10, the evidence does not meet a typical 0.05 significance threshold. The analyst should also disclose that records with missing campaign flags were excluded, since that data-quality issue affects interpretation. The best professional conclusion is cautious: report the observed lift, state that it is not statistically conclusive, and note the exclusion before recommending expansion.

Mean-only decision fails because an observed difference alone does not establish statistical support.
No-effect claim overstates the result; failing to reject a difference is not proof that no difference exists.
Causal segment claim goes beyond the summary statistics and the available data, especially with no segment-level evidence.

Question 10

Topic: Data Analysis

A support analytics team is updating a dashboard for the business objective: increase the share of billing questions resolved through self-service instead of agent contact.

Exhibit: Available dashboard measures

Measure	Definition
Help article views	Count of billing article page loads
Articles published	Count of new billing articles released
Self-service resolution rate	Billing help sessions with resolved feedback and no support ticket within 24 hours ÷ billing help sessions
Portal logins	Count of customer sign-ins to the help portal

Which measure is the best KPI for this objective?

Options:

A. Portal logins
B. Self-service resolution rate
C. Articles published
D. Help article views

Best answer: B

Explanation: A KPI should directly reflect the business outcome, not just activity that may or may not contribute to it. The stated objective is to increase the share of billing questions resolved through self-service instead of agent contact. A resolution-rate measure connects the self-service session to both successful resolution feedback and the absence of a follow-up support ticket, so it is aligned to the desired outcome. Counts such as views, articles, or logins can be useful supporting metrics, but they do not prove that customers solved billing issues without contacting support.

The key takeaway is to choose outcome-based KPIs when the business objective is outcome-based.

Article views can show usage, but high views may indicate confusion rather than successful self-service resolution.
Articles published tracks production activity, not whether customers resolved billing issues.
Portal logins measures access to the portal, but it does not show billing-question resolution or avoided tickets.

Continue in the web app

Use IT Mastery for interactive CompTIA Data+ DA0-002 practice with mixed sets, timed mocks, topic drills, explanations, and progress tracking.

Try CompTIA Data+ DA0-002 on Web

Data Acquisition and Preparation

Visualization and Reporting

Free CompTIA Data+ DA0-002 Practice Questions: Data Analysis

Topic snapshot

How to use this topic drill

Sample questions

Question 1

Question 2

Question 3

Question 4

Question 5

Question 6

Question 7

Question 8

Question 9

Question 10

Continue in the web app

Related focused pages

Browse Certification Practice Tests by Exam Family