An ecommerce company has developed a XGBoost model in Amazon SageMaker to predict whether a customer will return a purchased item. The dataset is imbalanced. Only 5% of customers return items
A data scientist must find the hyperparameters to capture as many instances of returned items as possible. The company has a small budget for compute.
How should the data scientist meet these requirements MOST cost-effectively?

Question

An ecommerce company has developed a XGBoost model in Amazon SageMaker to predict whether a customer will return a purchased item. The dataset is imbalanced. Only 5% of customers return items

A data scientist must find the hyperparameters to capture as many instances of returned items as possible. The company has a small budget for compute.

How should the data scientist meet these requirements MOST cost-effectively?

Tresor Garcia · Accepted Answer

Tune the csv_weight hyperparameter and the scale_pos_weight hyperparameter by using automatic model tuning (AMT). Optimize on {'HyperParameterTuningJobObjective': {'MetricName': 'validation:f1', 'Type': 'Maximize'}}.

Tresor Garcia · Answer

Tune all possible hyperparameters by using automatic model tuning (AMT). Optimize on {'HyperParameterTuningJobObjective': {'MetricName': 'validation:accuracy', 'Type': 'Maximize'}}

Tresor Garcia · Answer

Tune all possible hyperparameters by using automatic model tuning (AMT). Optimize on {'HyperParameterTuningJobObjective': {'MetricName': 'validation:f1', 'Type': 'Maximize'}}.

Tresor Garcia · Answer

Tune the csv_weight hyperparameter and the scale_pos_weight hyperparameter by using automatic model tuning (AMT). Optimize on {'HyperParameterTuningJobObjective': {'MetricName': 'validation:f1', 'Type': 'Minimize'}).

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 257 - MLS-C01 discussion

Suggested answer: B

0 comments