List of questions
Related questions
Question 48 - DEA-C01 discussion
A data engineer runs Amazon Athena queries on data that is in an Amazon S3 bucket. The Athena queries use AWS Glue Data Catalog as a metadata table.
The data engineer notices that the Athena query plans are experiencing a performance bottleneck. The data engineer determines that the cause of the performance bottleneck is the large number of partitions that are in the S3 bucket. The data engineer must resolve the performance bottleneck and reduce Athena query planning time.
Which solutions will meet these requirements? (Choose two.)
Create an AWS Glue partition index. Enable partition filtering.
Bucket the data based on a column that the data have in common in a WHERE clause of the user query
Use Athena partition projection based on the S3 bucket prefix.
Transform the data that is in the S3 bucket to Apache Parquet format.
Use the Amazon EMR S3DistCP utility to combine smaller objects in the S3 bucket into larger objects.
0 comments
Leave a comment first