You have created a Vertex Al pipeline that includes two steps. The first step preprocesses 10 TB data completes in about 1 hour, and saves the result in a Cloud Storage bucket The second step uses the processed data to train a model You need to update the model's code to allow you to test different algorithms You want to reduce pipeline execution time and cost, while also minimizing pipeline changes What should you do?

Question

MD Farmudin Safi · Accepted Answer

Enable caching for the pipeline job. and disable caching for the model training step.

MD Farmudin Safi · Answer

Add a pipeline parameter and an additional pipeline step Depending on the parameter value the pipeline step conducts or skips data preprocessing and starts model training.

MD Farmudin Safi · Answer

Create another pipeline without the preprocessing step, and hardcode the preprocessed Cloud Storage file location for model training.

MD Farmudin Safi · Answer

Configure a machine with more CPU and RAM from the compute-optimized machine family for the data preprocessing step.

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 258 - Professional Machine Learning Engineer discussion

Suggested answer: D

0 comments