You need to deploy a scikit-learn classification model to production. The model must be able to serve requests 24/7 and you expect millions of requests per second to the production application from 8 am to 7 pm. You need to minimize the cost of deployment What should you do?

Question

Shafqat Balouch · Accepted Answer

Deploy an online Vertex Al prediction endpoint Set the max replica count to 100

Shafqat Balouch · Answer

Deploy an online Vertex Al prediction endpoint Set the max replica count to 1

Shafqat Balouch · Answer

Deploy an online Vertex Al prediction endpoint with one GPU per replica Set the max replica count to 1.

Shafqat Balouch · Answer

Deploy an online Vertex Al prediction endpoint with one GPU per replica Set the max replica count to 100.

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 151 - Professional Machine Learning Engineer discussion

Suggested answer: B

0 comments