ExamGecko
Home Home / IIBA / CBDA
Question list
Search
Search

List of questions

Search

Related questions











Question 76 - CBDA discussion

Report
Export

An analytics team is sourcing data for a new analytics initiative and is deciding between two comparable data sources. One source being considered is a very large dataset and another consists of three smaller sources. What advantage will the larger dataset provide over the three smaller sources?

A.
More significant results
Answers
A.
More significant results
B.
Higher validity
Answers
B.
Higher validity
C.
More reproducibility
Answers
C.
More reproducibility
D.
Higher reliability
Answers
D.
Higher reliability
Suggested answer: A

Explanation:

Explanation: A larger dataset may provide more significant results than three smaller sources, as it may have more statistical power to detect differences or relationships among variables1.Statistical power is the probability of finding a statistically significant result when there is a true effect in the population2.A larger dataset may have more power because it may have more variability, less sampling error, and higher precision than smaller datasets3. More significant results may lead to more confident and valid conclusions and recommendations for the analytics initiative. Higher validity, more reproducibility, and higher reliability are not necessarily advantages of a larger dataset over three smaller sources, as they depend on other factors besides the size of the dat a.Validity is the degree to which the data and the analysis measure what they are intended to measure4. Reproducibility is the degree to which the data and the analysis can be replicated by another analyst using the same methods and data sources. Reliability is the degree to which the data and the analysis produce consistent results under the same conditions. These qualities may be affected by the quality, accuracy, completeness, and relevance of the data, as well as the appropriateness, transparency, and rigor of the analysis methods.A larger dataset may not be valid, reproducible, or reliable if it has errors, biases, missing values, or irrelevant variables, or if the analysis methods are not suitable, documented, or verified.

asked 18/09/2024
Harry Meijer
44 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first