ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 34 - Professional Machine Learning Engineer discussion

Report
Export

You are an ML engineer at a global car manufacturer. You need to build an ML model to predict car sales in different cities around the world. Which features or feature crosses should you use to train city-specific relationships between car type and number of sales?

A.
Three individual features binned latitude, binned longitude, and one-hot encoded car type
Answers
A.
Three individual features binned latitude, binned longitude, and one-hot encoded car type
B.
One feature obtained as an element-wise product between latitude, longitude, and car type
Answers
B.
One feature obtained as an element-wise product between latitude, longitude, and car type
C.
One feature obtained as an element-wise product between binned latitude, binned longitude, and one-hot encoded car type
Answers
C.
One feature obtained as an element-wise product between binned latitude, binned longitude, and one-hot encoded car type
D.
Two feature crosses as a element-wise product the first between binned latitude and one-hot encoded car type, and the second between binned longitude and one-hot encoded car type
Answers
D.
Two feature crosses as a element-wise product the first between binned latitude and one-hot encoded car type, and the second between binned longitude and one-hot encoded car type
Suggested answer: C

Explanation:

A feature cross is a synthetic feature that is obtained by combining two or more existing features, usually by taking their product or concatenation. A feature cross can help to capture the nonlinear and interaction effects between the original features, and improve the predictive performance of the model.A feature cross can be applied to different types of features, such as numeric, categorical, or geospatial features1.

For the use case of building an ML model to predict car sales in different cities around the world, the best option is to use one feature obtained as an element-wise product between binned latitude, binned longitude, and one-hot encoded car type. This option involves creating a feature cross that combines three individual features: binned latitude, binned longitude, and one-hot encoded car type. Binning is a technique that transforms a continuous numeric feature into a discrete categorical feature by dividing its range into equal intervals, or bins. One-hot encoding is a technique that transforms a categorical feature into a binary vector, where each element corresponds to a possible category, and has a value of 1 if the feature belongs to that category, and 0 otherwise. By applying binning and one-hot encoding to the latitude, longitude, and car type features, the feature cross can capture the city-specific relationships between car type and number of sales, as each combination of bins and car types can represent a different city and its preference for a certain car type. For example, the feature cross can learn that a city with a latitude bin of [40, 50], a longitude bin of [-80, -70], and a car type of SUV has a higher number of sales than a city with a latitude bin of [-10, 0], a longitude bin of [10, 20], and a car type of sedan. Therefore, using one feature obtained as an element-wise product between binned latitude, binned longitude, and one-hot encoded car type is the best option for this use case.

Feature Crosses | Machine Learning Crash Course

asked 18/09/2024
Benjamin Colart
47 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first