ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 570 - SAA-C03 discussion

Report
Export

A research company uses on-premises devices to generate data for analysis. The company wants to use the AWS Cloud to analyze the dat

a. The devices generate .csv files and support writing the data to SMB file share. Company analysts must be able to use SQL commands to query the data. The analysts will run queries periodically throughout the day.

Which combination of steps will meet these requirements MOST cost-effectively? (Select THREE.)

A.
Deploy an AWS Storage Gateway on premises in Amazon S3 File Gateway mode.
Answers
A.
Deploy an AWS Storage Gateway on premises in Amazon S3 File Gateway mode.
B.
Deploy an AWS Storage Gateway on premises in Amazon FSx File Gateway mode.
Answers
B.
Deploy an AWS Storage Gateway on premises in Amazon FSx File Gateway mode.
C.
Set up an AWS Glue crawler to create a table based on the data that is in Amazon S3.
Answers
C.
Set up an AWS Glue crawler to create a table based on the data that is in Amazon S3.
D.
Set up an Amazon EMR cluster with EMR Fife System (EMRFS) to query the data that is in Amazon S3. Provide access to analysts.
Answers
D.
Set up an Amazon EMR cluster with EMR Fife System (EMRFS) to query the data that is in Amazon S3. Provide access to analysts.
E.
Set up an Amazon Redshift cluster to query the data that is in Amazon S3. Provide access to analysts.
Answers
E.
Set up an Amazon Redshift cluster to query the data that is in Amazon S3. Provide access to analysts.
F.
Set up Amazon Athena to query the data that is in Amazon S3. Provide access to analysts.
Answers
F.
Set up Amazon Athena to query the data that is in Amazon S3. Provide access to analysts.
Suggested answer: A, C, F

Explanation:

To meet the requirements of the use case in a cost-effective way, the following steps are recommended:

Deploy an AWS Storage Gateway on premises in Amazon S3 File Gateway mode. This will allow the company to write the .csv files generated by the devices to an SMB file share, which will be stored as objects in Amazon S3 buckets. AWS Storage Gateway is a hybrid cloud storage service that integrates on-premises environments with AWS storage.Amazon S3 File Gateway mode provides a seamless way to connect to Amazon S3 and access a virtually unlimited amount of cloud storage1.

Set up an AWS Glue crawler to create a table based on the data that is in Amazon S3. This will enable the company to use standard SQL to query the data stored in Amazon S3 buckets. AWS Glue is a serverless data integration service that simplifies data preparation and analysis.AWS Glue crawlers can automatically discover and classify data from various sources, and create metadata tables in the AWS Glue Data Catalog2.The Data Catalog is a central repository that stores information about data sources and how to access them3.

Set up Amazon Athena to query the data that is in Amazon S3. This will provide the company analysts with a serverless and interactive query service that can analyze data directly in Amazon S3 using standard SQL. Amazon Athena is integrated with the AWS Glue Data Catalog, so users can easily point Athena at the data source tables defined by the crawlers.Amazon Athena charges only for the queries that are run, and offers a pay-per-query pricing model, which makes it a cost-effective option for periodic queries4.

The other options are not correct because they are either not cost-effective or not suitable for the use case. Deploying an AWS Storage Gateway on premises in Amazon FSx File Gateway mode is not correct because this mode provides low-latency access to fully managed Windows file shares in AWS, which is not required for the use case. Setting up an Amazon EMR cluster with EMR File System (EMRFS) to query the data that is in Amazon S3 is not correct because this option involves setting up and managing a cluster of EC2 instances, which adds complexity and cost to the solution. Setting up an Amazon Redshift cluster to query the data that is in Amazon S3 is not correct because this option also involves provisioning and managing a cluster of nodes, which adds overhead and cost to the solution.

What is AWS Storage Gateway?

What is AWS Glue?

AWS Glue Data Catalog

What is Amazon Athena?

asked 16/09/2024
Stefano Humphries
40 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first