You are developing an ML model using a dataset with categorical input variables. You have randomly split half of the data into training and test sets. After applying one-hot encoding on the categorical variables in the training set, you discover that one categorical variable is missing from the test set. What should you do?

Question

Jesus De Leon Luis · Accepted Answer

Apply one-hot encoding on the categorical variables in the test data.

Jesus De Leon Luis · Answer

Randomly redistribute the data, with 70% for the training set and 30% for the test set

Jesus De Leon Luis · Answer

Use sparse representation in the test set

Jesus De Leon Luis · Answer

Collect more data representing all categories

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 139 - Professional Machine Learning Engineer discussion

Suggested answer: C

0 comments