ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 223 - MLS-C01 discussion

Report
Export

A company is creating an application to identify, count, and classify animal images that are uploaded to the company's website. The company is using the Amazon SageMaker image classification algorithm with an ImageNetV2 convolutional neural network (CNN). The solution works well for most animal images but does not recognize many animal species that are less common.

The company obtains 10,000 labeled images of less common animal species and stores the images in Amazon S3. A machine learning (ML) engineer needs to incorporate the images into the model by using Pipe mode in SageMaker.

Which combination of steps should the ML engineer take to train the model? (Choose two.)

A.
Use a ResNet model. Initiate full training mode by initializing the network with random weights.
Answers
A.
Use a ResNet model. Initiate full training mode by initializing the network with random weights.
B.
Use an Inception model that is available with the SageMaker image classification algorithm.
Answers
B.
Use an Inception model that is available with the SageMaker image classification algorithm.
C.
Create a .lst file that contains a list of image files and corresponding class labels. Upload the .lst file to Amazon S3.
Answers
C.
Create a .lst file that contains a list of image files and corresponding class labels. Upload the .lst file to Amazon S3.
D.
Initiate transfer learning. Train the model by using the images of less common species.
Answers
D.
Initiate transfer learning. Train the model by using the images of less common species.
E.
Use an augmented manifest file in JSON Lines format.
Answers
E.
Use an augmented manifest file in JSON Lines format.
Suggested answer: C, D

Explanation:

The combination of steps that the ML engineer should take to train the model are to create a .lst file that contains a list of image files and corresponding class labels, upload the .lst file to Amazon S3, and initiate transfer learning by training the model using the images of less common species. This approach will allow the ML engineer to leverage the existing ImageNetV2 CNN model and fine-tune it with the new data using Pipe mode in SageMaker.

A .lst file is a text file that contains a list of image files and corresponding class labels, separated by tabs. The .lst file format is required for using the SageMaker image classification algorithm with Pipe mode. Pipe mode is a feature of SageMaker that enables streaming data directly from Amazon S3 to the training instances, without downloading the data first. Pipe mode can reduce the startup time, improve the I/O throughput, and enable training on large datasets that exceed the disk size limit.To use Pipe mode, the ML engineer needs to upload the .lst file to Amazon S3 and specify the S3 path as the input data channel for the training job1.

Transfer learning is a technique that enables reusing a pre-trained model for a new task by fine-tuning the model parameters with new data. Transfer learning can save time and computational resources, as well as improve the performance of the model, especially when the new task is similar to the original task. The SageMaker image classification algorithm supports transfer learning by allowing the ML engineer to specify the number of output classes and the number of layers to be retrained.The ML engineer can use the existing ImageNetV2 CNN model, which is trained on 1,000 classes of common objects, and fine-tune it with the new data of less common animal species, which is a similar task2.

The other options are either less effective or not supported by the SageMaker image classification algorithm. Using a ResNet model and initiating full training mode would require training the model from scratch, which would take more time and resources than transfer learning. Using an Inception model is not possible, as the SageMaker image classification algorithm only supports ResNet and ImageNetV2 models.Using an augmented manifest file in JSON Lines format is not compatible with Pipe mode, as Pipe mode only supports .lst files for image classification1.

References:

1: Using Pipe input mode for Amazon SageMaker algorithms | AWS Machine Learning Blog

2: Image Classification Algorithm - Amazon SageMaker

asked 16/09/2024
Matt Smith
47 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first