A data analyst is helping a retail store categorize its customers into five different groups
based on the following information:
• How recently the customers made purchases
• How frequently the customers made purchases
• How much the customers spent
Given the following information:

Which of the following would be most important for the analysis?
A. CustomerJD. Channel, Order_Date
B. CustomerJD, Territory. Amount
C. CustomerJD, Order_Date. Amount
D. CustomerJD. Quantity, Amount
Which of the following is an example of a discrete variable?
A. The temperature of a hot tub
B. The height of a horse
C. The time to complete a task
D. The number of people in an office
A data analyst was asked to create a visual representation of sales for the first quarter of 2020. Which of the following visualizations should be used when a time element is present?
A. A bubble chart
B. A line chart
C. A scatter plot
D. An infographic
Alex wants to use data from his corporate sale, CRM, and shipping systems to try and
predict future sales.
Which of the following systems is the most appropriate?
Choose the best answer.
A. Data mart.
B. OLAP.
C. Data Warehouse.
D. OLTP.
A client has requested an analysis of all pet care items purchased by current customers and their social media connections in the past 12 months. Which of the following data analysis techniques would be the best choice given these requirements?
A. Trend analysis
B. Performance analysis
C. Link analysis
D. Exploratory data analysis
Which of the following differentiates a flat text file from other data types?
A. Data is separated by a delimiter.
B. Data is stored in defined rows.
C. Data is defined with key-value pairs.
D. Data is housed in a markup language.
Which of the following is a common data analytics tool that is also used as an interpreted, high-level, general-purpose programming language?
A. SAS
B. Microsoft Power BI
C. IBM SPSS
D. Python
An analyst is updating a customer contacts database with information obtained from a survey of new customers. Which of the following data manipulation techniques should the analyst use?
A. Join
B. Append
C. Transform
D. Blend
A data analyst must separate the column shown below into multiple columns for each
component of the name:

Which of the following data manipulation techniques should the analyst perform?
A. Imputing
B. Transposing
C. Parsing
D. Concatenating
Consider the following dataset which contains information about houses that are for sale:

Which of the following string manipulation commands will combine the address and region
name columns to create a full address?
full_address------------------------- 85 Turner St, Northern Metropolitan 25 Bloomburg St,
Northern Metropolitan 5 Charles St, Northern Metropolitan 40 Federation La, Northern
Metropolitan 55a Park St, Northern Metropolitan
A. SELECT CONCAT(address, ' , ' , regionname) AS full_address FROM melb LIMIT 5;
B. SELECT CONCAT(address, '-' , regionname) AS full_address FROM melb LIMIT 5;
C. SELECT CONCAT(regionname, ' , ' , address) AS full_address FROM melb LIMIT 5
D. SELECT CONCAT(regionname, '-' , address) AS full_address FROM melb LIMIT 5;
A data analyst needs to create a master file that includes customer information from the
tables below:

Given the three tables above, the analyst wants to filter down the information prior to
joining it together. In which of the following orders should this data manipulation bo
approached for the most efficient result?
A. Merge, append, deduplicate
B. Merge, deduplicate, append
C. Deduplicate, append, merge
D. Append, deduplicate, merge
You are working with a dataset and want to change the names of categories that you used
fordifferent types of books.
What term best describes this action?
A. Recording.
B. Summarizing
C. Aggregating.
D. Filtering.
| Page 13 out of 33 Pages |
| Previous |