ExamGecko
Question list
Search
Search

List of questions

Search

Question 85 - DA0-001 discussion

Report
Export

The ACME Corporation hired an analyst to detect data quality issues in their Excel documents. Which of the following are the most common issues? (Select TWO)

A.
Apostrophe.
Answers
A.
Apostrophe.
B.
Commas.
Answers
B.
Commas.
C.
Symbols.
Answers
C.
Symbols.
D.
Duplicates.
Answers
D.
Duplicates.
E.
Misspellings.
Answers
E.
Misspellings.
Suggested answer: D, E

Explanation:

1. Duplicates

2. Misspellings

The most common data quality issues are difficult to resolve in Excel because of their rigidity. It forces analysts to do a ton of manual work, which results in a high probability of an error being introduced to the data set. Those common issues include:

- Blanks

- Nulls

- Outliers

- Duplicates

- Extra spaces

- Misspellings

- Abbreviations and domain-specific variations

- Formula error codes

When introduced, these errors can skew or even invalidate the resulting analysis. A smart tool would minimize the possibility of error by automating the manual work. In Excel, you might look for data quality issues in one of two ways. First, you might use auto filters on specific columns to scan for anomalies and blanks or you might use a pivot table to find gaps and discrepancies.

In either case, you're scanning for the anomalies yourself. Suffice it to say that's not a very efficient process. It also means accuracy is only as good as the analyst's eye, so the probability of error varies throughout the day.

asked 02/10/2024
Santanu Roy
34 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first