You have a demand forecasting pipeline in production that uses Dataflow to preprocess raw data prior to model training and prediction. During preprocessing, you employ Z-score normalization on data stored in BigQuery and write it back to BigQuery. New training data is added every week. You want to make the process more efficient by minimizing computation time and manual intervention. What should you do?

Question

Jeffrey Sammaritano · Accepted Answer

Translate the normalization algorithm into SQL for use with BigQuery

Jeffrey Sammaritano · Answer

Normalize the data using Google Kubernetes Engine

Jeffrey Sammaritano · Answer

Use the normalizer_fn argument in TensorFlow's Feature Column API

Jeffrey Sammaritano · Answer

Normalize the data with Apache Spark using the Dataproc connector for BigQuery

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 30 - Professional Machine Learning Engineer discussion

Suggested answer: B

0 comments