List of questions
Related questions
Question 188 - Professional Machine Learning Engineer discussion
You have built a custom model that performs several memory-intensive preprocessing tasks before it makes a prediction. You deployed the model to a Vertex Al endpoint. and validated that results were received in a reasonable amount of time After routing user traffic to the endpoint, you discover that the endpoint does not autoscale as expected when receiving multiple requests What should you do?
A.
Use a machine type with more memory
B.
Decrease the number of workers per machine
C.
Increase the CPU utilization target in the autoscaling configurations
D.
Decrease the CPU utilization target in the autoscaling configurations
Your answer:
0 comments
Sorted by
Leave a comment first