ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 129 - DP-100 discussion

Report
Export

You plan to use automated machine learning to train a regression model. You have data that has features which have missing values, and categorical features with few distinct values.

You need to configure automated machine learning to automatically impute missing values and encode categorical features as part of the training task.

Which parameter and value pair should you use in the AutoMLConfig class?

A.
featurization = 'auto'
Answers
A.
featurization = 'auto'
B.
enable_voting_ensemble = True
Answers
B.
enable_voting_ensemble = True
C.
task = 'classification'
Answers
C.
task = 'classification'
D.
exclude_nan_labels = True
Answers
D.
exclude_nan_labels = True
E.
enable_tf = True
Answers
E.
enable_tf = True
Suggested answer: A

Explanation:

Featurization str or FeaturizationConfig

Values: 'auto' / 'off' / FeaturizationConfig

Indicator for whether featurization step should be done automatically or not, or whether customized featurization should be used.

Column type is automatically detected. Based on the detected column type preprocessing/featurization is done as follows:

Categorical: Target encoding, one hot encoding, drop high cardinality categories, impute missing values.

Numeric: Impute missing values, cluster distance, weight of evidence.

DateTime: Several features such as day, seconds, minutes, hours etc.

Text: Bag of words, pre-trained Word embedding, text target encoding.

Reference:

https://docs.microsoft.com/en-us/python/api/azureml-train-automl-client/azureml.train.automl.automlconfig.automlconfig

asked 02/10/2024
Victor Armando Sánchez
37 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first