You have deployed a model on Vertex AI for real-time inference. During an online prediction request, you get an ''Out of Memory'' error. What should you do?

Question

Jesse Serrano · Accepted Answer

Send the request again with a smaller batch of instances.

Jesse Serrano · Answer

Use batch prediction mode instead of online mode.

Jesse Serrano · Answer

Use base64 to encode your data before using it for prediction.

Jesse Serrano · Answer

Apply for a quota increase for the number of prediction requests.

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 93 - Professional Machine Learning Engineer discussion

Suggested answer: B

0 comments