A Data Scientist is developing a binary classifier to predict whether a patient has a particular disease on a series of test results. The Data Scientist has data on 400 patients randomly selected from the population. The disease is seen in 3% of the population.
Which cross-validation strategy should the Data Scientist adopt?

Question

A Data Scientist is developing a binary classifier to predict whether a patient has a particular disease on a series of test results. The Data Scientist has data on 400 patients randomly selected from the population. The disease is seen in 3% of the population.

Which cross-validation strategy should the Data Scientist adopt?

Rio Ordonez · Accepted Answer

A stratified k-fold cross-validation strategy with k=5

Rio Ordonez · Answer

A k-fold cross-validation strategy with k=5

Rio Ordonez · Answer

A k-fold cross-validation strategy with k=5 and 3 repeats

Rio Ordonez · Answer

An 80/20 stratified split between training and validation

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 174 - MLS-C01 discussion

Suggested answer: B

0 comments