List of questions
Related questions
Question 247 - Professional Machine Learning Engineer discussion
While running a model training pipeline on Vertex Al, you discover that the evaluation step is failing because of an out-of-memory error. You are currently using TensorFlow Model Analysis (TFMA) with a standard Evaluator TensorFlow Extended (TFX) pipeline component for the evaluation step. You want to stabilize the pipeline without downgrading the evaluation quality while minimizing infrastructure overhead. What should you do?
A.
Add tfma.MetricsSpec () to limit the number of metrics in the evaluation step.
B.
Migrate your pipeline to Kubeflow hosted on Google Kubernetes Engine, and specify the appropriate node parameters for the evaluation step.
C.
Include the flag -runner=DataflowRunner in beam_pipeline_args to run the evaluation step on Dataflow.
D.
Move the evaluation step out of your pipeline and run it on custom Compute Engine VMs with sufficient memory.
Your answer:
0 comments
Sorted by
Leave a comment first