You work on a data science team at a bank and are creating an ML model to predict loan default risk. You have collected and cleaned hundreds of millions of records worth of training data in a BigQuery table, and you now want to develop and compare multiple models on this data using TensorFlow and Vertex AI. You want to minimize any bottlenecks during the data ingestion state while considering scalability. What should you do?

Question

Scott Taylor · Accepted Answer

Use TensorFlow I/O's BigQuery Reader to directly read the data.

Scott Taylor · Answer

Use the BigQuery client library to load data into a dataframe, and use tf.data.Dataset.from_tensor_slices() to read it.

Scott Taylor · Answer

Export data to CSV files in Cloud Storage, and use tf.data.TextLineDataset() to read them.

Scott Taylor · Answer

Convert the data into TFRecords, and use tf.data.TFRecordDataset() to read them.

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 129 - Professional Machine Learning Engineer discussion

Suggested answer: D

0 comments