ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 270 - Professional Machine Learning Engineer discussion

Report
Export

You have developed an AutoML tabular classification model that identifies high-value customers who interact with your organization's website.

You plan to deploy the model to a new Vertex Al endpoint that will integrate with your website application. You expect higher traffic to the website during nights and weekends. You need to configure the model endpoint's deployment settings to minimize latency and cost. What should you do?

A.
Configure the model deployment settings to use an n1-standard-32 machine type.
Answers
A.
Configure the model deployment settings to use an n1-standard-32 machine type.
B.
Configure the model deployment settings to use an n1-standard-4 machine type. Set the minReplicaCount value to 1 and the maxReplicaCount value to 8.
Answers
B.
Configure the model deployment settings to use an n1-standard-4 machine type. Set the minReplicaCount value to 1 and the maxReplicaCount value to 8.
C.
Configure the model deployment settings to use an n1-standard-4 machine type and a GPU accelerator. Set the minReplicaCount value to 1 and the maxReplicaCount value to 4.
Answers
C.
Configure the model deployment settings to use an n1-standard-4 machine type and a GPU accelerator. Set the minReplicaCount value to 1 and the maxReplicaCount value to 4.
D.
Configure the model deployment settings to use an n1-standard-8 machine type and a GPU accelerator.
Answers
D.
Configure the model deployment settings to use an n1-standard-8 machine type and a GPU accelerator.
Suggested answer: B

Explanation:

Deploying a model to an endpoint in Vertex AI associates physical resources with the model so it can serve online predictions with low latency1.By configuring the model deployment settings to use an n1-standard-4 machine type and setting the minReplicaCount value to 1 and the maxReplicaCount value to 8, you can ensure that the model scales according to the traffic, thereby minimizing latency and cost1.The n1-standard-4 machine type provides a balance between computing power and cost, and the dynamic scaling allows the model to handle higher traffic during nights and weekends without incurring unnecessary costs during off-peak times

asked 18/09/2024
Fathy Refay
42 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first