ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 242 - MLS-C01 discussion

Report
Export

A company is planning a marketing campaign to promote a new product to existing customers. The company has data (or past promotions that are similar. The company decides to try an experiment to send a more expensive marketing package to a smaller number of customers. The company wants to target the marketing campaign to customers who are most likely to buy the new product. The experiment requires that at least 90% of the customers who are likely to purchase the new product receive the marketing materials.

...company trains a model by using the linear learner algorithm in Amazon SageMaker. The model has a recall score of 80% and a precision of 75%.

...should the company retrain the model to meet these requirements?

A.
Set the target_recall hyperparameter to 90% Set the binaryclassrfier model_selection_critena hyperparameter to recall_at_target_precision.
Answers
A.
Set the target_recall hyperparameter to 90% Set the binaryclassrfier model_selection_critena hyperparameter to recall_at_target_precision.
B.
Set the targetprecision hyperparameter to 90%. Set the binary classifier model selection criteria hyperparameter to precision at_jarget recall.
Answers
B.
Set the targetprecision hyperparameter to 90%. Set the binary classifier model selection criteria hyperparameter to precision at_jarget recall.
C.
Use 90% of the historical data for training Set the number of epochs to 20.
Answers
C.
Use 90% of the historical data for training Set the number of epochs to 20.
D.
Set the normalize_jabel hyperparameter to true. Set the number of classes to 2.
Answers
D.
Set the normalize_jabel hyperparameter to true. Set the number of classes to 2.
Suggested answer: A

Explanation:

The best way to retrain the model to meet the requirements is to set the target_recall hyperparameter to 90% and set the binary_classifier_model_selection_criteria hyperparameter to recall_at_target_precision. This will instruct the linear learner algorithm to optimize the model for a high recall score, while maintaining a reasonable precision score.Recall is the proportion of actual positives that were identified correctly, which is important for the company's goal of reaching at least 90% of the customers who are likely to buy the new product1.Precision is the proportion of positive identifications that were actually correct, which is also relevant for the company's budget and efficiency2.By setting the target_recall to 90%, the algorithm will try to achieve a recall score of at least 90%, and by setting the binary_classifier_model_selection_criteria to recall_at_target_precision, the algorithm will select the model that has the highest recall score among those that have a precision score equal to or higher than the target precision3.The target precision is automatically set to the median of the precision scores of all the models trained in parallel4.

The other options are not correct or optimal, because they have the following drawbacks:

B: Setting the target_precision hyperparameter to 90% and setting the binary_classifier_model_selection_criteria hyperparameter to precision_at_target_recall will optimize the model for a high precision score, while maintaining a reasonable recall score.However, this is not aligned with the company's goal of reaching at least 90% of the customers who are likely to buy the new product, as precision does not reflect how well the model identifies the actual positives1.Moreover, setting the target_precision to 90% might be too high and unrealistic for the dataset, as the current precision score is only 75%4.

C: Using 90% of the historical data for training and setting the number of epochs to 20 will not necessarily improve the recall score of the model, as it does not change the optimization objective or the model selection criteria.Moreover, using more data for training might reduce the amount of data available for validation, which is needed for selecting the best model among the ones trained in parallel3.The number of epochs is also not a decisive factor for the recall score, as it depends on the learning rate, the optimizer, and the convergence of the algorithm5.

D: Setting the normalize_label hyperparameter to true and setting the number of classes to 2 will not affect the recall score of the model, as these are irrelevant hyperparameters for binary classification problems.The normalize_label hyperparameter is only applicable for regression problems, as it controls whether the label is normalized to have zero mean and unit variance3.The number of classes hyperparameter is only applicable for multiclass classification problems, as it specifies the number of output classes3.

References:

1:Classification: Precision and Recall | Machine Learning | Google for Developers

2:Precision and recall - Wikipedia

3:Linear Learner Algorithm - Amazon SageMaker

4:How linear learner works - Amazon SageMaker

5:Getting hands-on with Amazon SageMaker Linear Learner - Pluralsight

asked 16/09/2024
Mario Jose Oliveros Recinos
42 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first