ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 24 - HPE2-N69 discussion

Report
Export

A trial is running on a GPU slot within a resource pool on HPE Machine Learning Development Environment. That GPU fails. What happens next?

A.
The trial tails, and the ML engineer must restart it manually by re-running the experiment.
Answers
A.
The trial tails, and the ML engineer must restart it manually by re-running the experiment.
B.
The concluded reschedules the trial on another available GPU in the pool, and the trial restarts from the state of the latest training workload.
Answers
B.
The concluded reschedules the trial on another available GPU in the pool, and the trial restarts from the state of the latest training workload.
C.
The conductor reschedules the trial on another available GPU in the pool, and the trial restarts from the latest checkpoint.
Answers
C.
The conductor reschedules the trial on another available GPU in the pool, and the trial restarts from the latest checkpoint.
D.
The trial fails, and the ML engineer must manually restart it from the latest checkpoint using the WebUI.
Answers
D.
The trial fails, and the ML engineer must manually restart it from the latest checkpoint using the WebUI.
Suggested answer: C
asked 16/09/2024
Rudy Raijmakers
40 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first