You are building a model to predict daily temperatures. You split the data randomly and then transformed the training and test datasets. Temperature data for model training is uploaded hourly. During testing, your model performed with 97% accuracy; however, after deploying to production, the model's accuracy dropped to 66%. How can you make your production model more accurate?

Question

massamba gaye · Accepted Answer

Split the training and test data based on time rather than a random split to avoid leakage

massamba gaye · Answer

Normalize the data for the training, and test datasets as two separate steps.

massamba gaye · Answer

Add more data to your test set to ensure that you have a fair distribution and sample for testing

massamba gaye · Answer

Apply data transformations before splitting, and cross-validate to make sure that the transformations are applied to both the training and test sets.

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 29 - Professional Machine Learning Engineer discussion

Suggested answer: B

0 comments