You have built a custom model that performs several memory-intensive preprocessing tasks before it makes a prediction. You deployed the model to a Vertex Al endpoint. and validated that results were received in a reasonable amount of time After routing user traffic to the endpoint, you discover that the endpoint does not autoscale as expected when receiving multiple requests What should you do?

Question

RANA MANSOUR · Accepted Answer

Decrease the CPU utilization target in the autoscaling configurations

RANA MANSOUR · Answer

Use a machine type with more memory

RANA MANSOUR · Answer

Decrease the number of workers per machine

RANA MANSOUR · Answer

Increase the CPU utilization target in the autoscaling configurations

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 188 - Professional Machine Learning Engineer discussion

Suggested answer: D

0 comments