List of questions
Question 24 - HPE2-N69 discussion
A trial is running on a GPU slot within a resource pool on HPE Machine Learning Development Environment. That GPU fails. What happens next?
A.
The trial tails, and the ML engineer must restart it manually by re-running the experiment.
B.
The concluded reschedules the trial on another available GPU in the pool, and the trial restarts from the state of the latest training workload.
C.
The conductor reschedules the trial on another available GPU in the pool, and the trial restarts from the latest checkpoint.
D.
The trial fails, and the ML engineer must manually restart it from the latest checkpoint using the WebUI.
Your answer:
0 comments
Sorted by
Leave a comment first