DELL D-DS-FN-23 Practice Test - Questions Answers

List of questions
Question 1

When should you consider using multinomial logistic regression over binary logistic regression?
Dependent variable is continuous or dichotomous
Dependent variable is continuous or categorical
Dependent variable has more than two categories
Dependent variable is continuous only
Question 2

In which programming language is Hadoop written?
C++
Scala
Java
Python
Question 3

What is a key consideration when preparing a presentation intended for analysts?
Describe how to implement the model
Provide talking points to promote or evangelize the project
Emphasize the business benefits of implementing the model
Focus on clean simple-to-understand visuals
Question 4

A logistic regression model is built to determine the probability of a credit card borrower defaulting on a credit loan. A threshold value of 0.3 is selected. Which statement can be used to predict a borrower will default?
If probability > 0.1, then predict the borrower will default
If probability < 0.1, then predict the borrower will default
If probability > 0.3, then predict the borrower will default
If probability < 0.3, then predict the borrower will default
Question 5

What are the two data categories that represent qualitative data?
Ordinal and interval
Nominal and ordinal
Ratio and interval
Nominal and ratio
Question 6

In hypothesis testing, when does a Type I error occur?
Null hypothesis is rejected when it is actually false
Null hypothesis is rejected when it is actually true
Null hypothesis is accepted when it is actually false
Null hypothesis is accepted when it is actually true
Question 7

You have been given a task to improve sales force compensation of your organization. As a result of a study, your team decides to classify personnel as follows:
Did not meet quota
Met quota
Exceeded 150% of quota
In which data analytics lifecycle phase should you define these categories for analysis purposes?
Model building
Communicate results
Operationalize
Model planning
Question 8

A decision tree is being built. An internal node is being evaluated for partitioning on variables A and B. The entropy of the internal node is 0.8. The entropy for each of the variables is as follows:
Variable A: 0.5
Variable B: 0.4
Which variable will be used to partition the data and what is the information gain?
Variable B; information gain is 0.1
Variable B; information gain is 0.4
Variable A; information gain is 0.1
Variable A; information gain is 0.5
Question 9

In association rules, given items X and Y, what does lift measure?
Percentage of transactions that contain an itemset with X
Percentage of transactions with Xthat also contain Y
Difference in the probability ofX and Y appearing together compared with expectations as if they were statistically independent
How many times more often X and Y occur together than expected if they were statistically independent, expressed as a ratio
Question 10

What are categorized as cluster and workflow management tools for Hadoop?
Flume, Sqoop, and Storm
Drill, Hive, and HBase
Spark, Tez, and Cassandra
Ambari, Oozie, and Zookeeper
Question