An airline has been collecting metrics on flight activities for analytics. A recently completed proof of concept demonstrates how the company provides insights to data analysts to improve on-time departures. The proof of concept used objects in Amazon S3, which contained the metrics in .csv format, and used Amazon Athena for querying the data. As the amount of data increases, the data analyst wants to optimize the storage solution to improve query performance.
Which options should the data analyst use to improve performance as the data lake grows? (Choose three.)

Question

An airline has been collecting metrics on flight activities for analytics. A recently completed proof of concept demonstrates how the company provides insights to data analysts to improve on-time departures. The proof of concept used objects in Amazon S3, which contained the metrics in .csv format, and used Amazon Athena for querying the data. As the amount of data increases, the data analyst wants to optimize the storage solution to improve query performance.

Which options should the data analyst use to improve performance as the data lake grows? (Choose three.)

Oeurn Chan · Accepted Answer

Add a randomized string to the beginning of the keys in S3 to get more throughput across partitions.

Oeurn Chan · Accepted Answer

Compress the objects to reduce the data transfer I/O.

Oeurn Chan · Accepted Answer

Preprocess the .csv data to JSON to reduce I/O by fetching only the document keys needed by the query.

Oeurn Chan · Answer

Use an S3 bucket in the same account as Athena.

Oeurn Chan · Answer

Use an S3 bucket in the same Region as Athena.

Oeurn Chan · Answer

Preprocess the .csv data to Apache Parquet to reduce I/O by fetching only the data blocks needed for predicates.

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 64 - DAS-C01 discussion

Suggested answer: A, C, E

0 comments