Given the following grocery store orders:

If a query is made to the table with the following logic:
Order_Total > 132 OR (Order Total >= 25 AND Order_Total < 74)
Which of the following is the number of orders that will be returned by the query?
A. Four
B. Five
C. Six
D. Seven
A data analyst has been asked to derive a new variable labeled “Promotion_flag” based on
the total quantity sold by each salesperson. Given the table below:

Which of the following functions would the analyst consider appropriate to flag “Yes” for
every salesperson who has a number above 1,000,000 in the Quantity_sold column?
A. Date
B. Mathematical
C. Logical
D. Aggregate
An organization would like to add a secondary email field to its customer database in order toenrich the customer profiles. Which of the following data manipulation techniques should the analyst use to add this information?
A. Blend
B. Merge
C. Append
D. Aggregate
A data engineer is creating a database field to capture whether a customer likes vanilla ice cream. Which of the following data types is the best to capture this information?
A. Integer
B. Boolean
C. Categorical
D. Numeric
Which of the following describes the method of sampling in which elements of data are selected randomly from each of the small subgroups within a population?
A. Simple random
B. Cluster
C. Systematic
D. Stratified
A sales manager requested a report that contains the first name, last name, and phone number of all the company’s customers and employees. The data engineer needs to return all the records from several tables, even duplicates. Which of the following is the best way to join the two tables?
A. FULL OUTER JOIN
B. INNER JOIN
C. LEFT OUTER JOIN
D. CROSS JOIN
Which of the following data types best describe 4Ac1? (Select two).
A. Alphanumeric
B. Symbolic
C. Numeric
D. Float
E. Boolean
F. String
Which of the following is the most appropriate to consider when creating a schema of a central group broken into detailed subcategories?
A. Relational
B. Hierarchical
C. Snowflake
D. Star
What role in a data governance is typically responsible for day-to-day oversight of data use?
A. Data processors.
B. Data custodians
C. Data owners.
D. Data stewards.
Which of the following would be considered non-personally identifiable information?
A. Cell phone device name
B. Customer’s name
C. Government ID number
D. Telephone number
A financial institution is reporting on sales performance to a company at the account level. Due to the sensitive nature of the government the does il with, some account information is not shown. Which of the following fields should be masked?
A. Sales volume
B. Start date
C. Product name
D. Customer name
Which of the following programming languages are best suited for analysis and machine learning applications? (Select two).
A. Ruby
B. Rust
C. PHP
D. Python
E. Kotlin
F. R
| Page 2 out of 33 Pages |
| Previous |