ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 96 - DEA-C01 discussion

Report
Export

A company has a data lake in Amazon S3. The company collects AWS CloudTrail logs for multiple applications. The company stores the logs in the data lake, catalogs the logs in AWS Glue, and partitions the logs based on the year. The company uses Amazon Athena to analyze the logs.

Recently, customers reported that a query on one of the Athena tables did not return any data. A data engineer must resolve the issue.

Which combination of troubleshooting steps should the data engineer take? (Select TWO.)

A.

Confirm that Athena is pointing to the correct Amazon S3 location.

Answers
A.

Confirm that Athena is pointing to the correct Amazon S3 location.

B.

Increase the query timeout duration.

Answers
B.

Increase the query timeout duration.

C.

Use the MSCK REPAIR TABLE command.

Answers
C.

Use the MSCK REPAIR TABLE command.

D.

Restart Athena.

Answers
D.

Restart Athena.

E.

Delete and recreate the problematic Athena table.

Answers
E.

Delete and recreate the problematic Athena table.

Suggested answer: A, C

Explanation:

The problem likely arises from Athena not being able to read from the correct S3 location or missing partitions. The two most relevant troubleshooting steps involve checking the S3 location and repairing the table metadata.

A . Confirm that Athena is pointing to the correct Amazon S3 location:

One of the most common issues with missing data in Athena queries is that the query is pointed to an incorrect or outdated S3 location. Checking the S3 path ensures Athena is querying the correct data.

C . Use the MSCK REPAIR TABLE command:

When new partitions are added to the S3 bucket without being reflected in the Glue Data Catalog, Athena queries will not return data from those partitions. The MSCK REPAIR TABLE command updates the Glue Data Catalog with the latest partitions.

Alternatives Considered:

B (Increase query timeout): Timeout issues are unrelated to missing data.

D (Restart Athena): Athena does not require restarting.

E (Delete and recreate table): This introduces unnecessary overhead when the issue can be resolved by repairing the table and confirming the S3 location.

Athena Query Fails to Return Data

asked 29/10/2024
Pieter Louw
44 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first