ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 25 - MLS-C01 discussion

Report
Export

A media company is building a computer vision model to analyze images that are on social media. The model consists of CNNs that the company trained by using images that the company stores in Amazon S3. The company used an Amazon SageMaker training job in File mode with a single Amazon EC2 On-Demand Instance.

Every day, the company updates the model by using about 10,000 images that the company has collected in the last 24 hours. The company configures training with only one epoch. The company wants to speed up training and lower costs without the need to make any code changes.

Which solution will meet these requirements?

A.
Instead of File mode, configure the SageMaker training job to use Pipe mode. Ingest the data from a pipe.
Answers
A.
Instead of File mode, configure the SageMaker training job to use Pipe mode. Ingest the data from a pipe.
B.
Instead Of File mode, configure the SageMaker training job to use FastFile mode with no Other changes.
Answers
B.
Instead Of File mode, configure the SageMaker training job to use FastFile mode with no Other changes.
C.
Instead Of On-Demand Instances, configure the SageMaker training job to use Spot Instances. Make no Other changes.
Answers
C.
Instead Of On-Demand Instances, configure the SageMaker training job to use Spot Instances. Make no Other changes.
D.
Instead Of On-Demand Instances, configure the SageMaker training job to use Spot Instances. Implement model checkpoints.
Answers
D.
Instead Of On-Demand Instances, configure the SageMaker training job to use Spot Instances. Implement model checkpoints.
Suggested answer: C

Explanation:

The solution C will meet the requirements because it uses Amazon SageMaker Spot Instances, which are unused EC2 instances that are available at up to 90% discount compared to On-Demand prices. Amazon SageMaker Spot Instances can speed up training and lower costs by taking advantage of the spare EC2 capacity. The company does not need to make any code changes to use Spot Instances, as it can simply enable the managed spot training option in the SageMaker training job configuration.The company also does not need to implement model checkpoints, as it is using only one epoch for training, which means the model will not resume from a previous state1.

The other options are not suitable because:

Option A: Configuring the SageMaker training job to use Pipe mode instead of File mode will not speed up training or lower costs significantly. Pipe mode is a data ingestion mode that streams data directly from S3 to the training algorithm, without copying the data to the local storage of the training instance. Pipe mode can reduce the startup time of the training job and the disk space usage, but it does not affect the computation time or the instance price.Moreover, Pipe mode may require some code changes to handle the streaming data, depending on the training algorithm2.

Option B: Configuring the SageMaker training job to use FastFile mode instead of File mode will not speed up training or lower costs significantly. FastFile mode is a data ingestion mode that copies data from S3 to the local storage of the training instance in parallel with the training process. FastFile mode can reduce the startup time of the training job and the disk space usage, but it does not affect the computation time or the instance price.Moreover, FastFile mode is only available for distributed training jobs that use multiple instances, which is not the case for the company3.

Option D: Configuring the SageMaker training job to use Spot Instances and implementing model checkpoints will not meet the requirements without the need to make any code changes. Model checkpoints are a feature that allows the training job to save the model state periodically to S3, and resume from the latest checkpoint if the training job is interrupted.Model checkpoints can help to avoid losing the training progress and ensure the model convergence, but they require some code changes to implement the checkpointing logic and the resuming logic4.

References:

1: Managed Spot Training - Amazon SageMaker

2: Pipe Mode - Amazon SageMaker

3: FastFile Mode - Amazon SageMaker

4: Checkpoints - Amazon SageMaker

asked 16/09/2024
Phil Horikawa
38 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first