List of questions
Related questions
Question 223 - Professional Data Engineer discussion
You are working on a linear regression model on BigQuery ML to predict a customer's likelihood of purchasing your company's products. Your model uses a city name variable as a key predictive component in order to train and serve the model your data must be organized in columns. You want to prepare your data using the least amount of coding while maintaining the predictable variables.
What should you do?
A.
Use SQL in BigQuery to transform the stale column using a one-hot encoding method, and make each city a column with binary values.
B.
Create a new view with BigQuery that does not include a column which city information.
C.
Cloud Data Fusion to assign each city to a region that is labeled as 1, 2 3, 4, or 5, and then use that number to represent the city in the model.
D.
Use TensorFlow to create a categorical variable with a vocabulary list. Create the vocabulary file and upload that as part of your model to BigQuery ML.
Your answer:
0 comments
Sorted by
Leave a comment first