DA0-001 Practice Test Questions

393 Questions


Angela is aggregating data from CRM system with data from an employee system.

While performing an initial quality check, she realizes that her employee ID is not associated with her identifier in the CRM system.

What kind of issues is Angela facing?

Choose the best answer.


A. ETL process.


B. Record linkage.


C. ELT process.


D. System integration.





B.
  Record linkage.

Which of the following value is the measure of dispersion "range" between the scores of ten students in a test.

The scores of ten students in a test are 17, 23, 30, 36, 45, 51, 58, 66, 72, 77.


A. 90


B. 60


C. 70


D. 80





B.
  60

Given the table below:

Which of the following boxes indicates that a Type Il error has occurred?


A. 1


B. 2


C. 3


D. 4





C.
  3

An analyst needs to conduct a quick analysis. Which of the following is the FIRST step the analyst should perform with the data?


A. Conduct an exploratory analysis and use descriptive statistics.


B. Conduct a trend analysis and use a scatter chart.


C. Conduct a link analysis and illustrate the connection points.


D. Conduct an initial analysis and use a Pareto chart.





A.
  Conduct an exploratory analysis and use descriptive statistics.

For which of the following test statistics would a low value imply a potentially meaningful result?


A. Chi-squared


B. p-value


C. t-test


D. F-test





B.
  p-value

The number of phone calls that the call center receives in a day is an example of:


A. continuous data.


B. categorical data.


C. ordinal data.


D. discrete data.





D.
  discrete data.

An analyst modified a data set that had a number of issues. Given the original and modified versions:

Which of the following data manipulation techniques did the analyst use?


A. Imputation


B. Recoding


C. Parsing


D. Deriving





B.
  Recoding

A healthcare data analyst notices that one data set in the column for BloodPressure contains several outliers that need to be replaced with meaningful values. Which of the following data manipulation techniques should the analyst use?


A. Recode


B. Impute


C. Append


D. Reduction





B.
  Impute

What category of data stewardship work is focused on ensuring that the organization respects the wishes of data subjects?


A. Data quality.


B. Data privacy.


C. Data security.


D. Regulatory compliance.





B.
  Data privacy.

Which of the following query optimization techniques involves examining only the data that is needed for a particular task?


A. Making a temporary table


B. Creating a flat file


C. Indexing documents


D. Creating an execution plan





C.
  Indexing documents

An analyst has been asked to validate data quality. Which of the following are the BEST reasons to validate data for quality control purposes? (Choose two.)


A. Retention


B. Integrity


C. Transmission


D. Consistency


E. Encryption


F. Deletion





B.
  Integrity

Which of the following describes the use of a representative amount of data from a main repository?


A. Observation


B. Delta load


C. Web scraping


D. Sampling





D.
  Sampling


Page 5 out of 33 Pages
Previous