A data engineer is building a data pipeline on AWS by using AWS Glue extract, transform, and load (ETL) jobs. The data engineer needs to process data from Amazon RDS and MongoDB, perform transformations, and load the transformed data into Amazon Redshift for analytics. The data updates must occur every hour.
Which combination of tasks will meet these requirements with the LEAST operational overhead? (Choose two.)

Question

A data engineer is building a data pipeline on AWS by using AWS Glue extract, transform, and load (ETL) jobs. The data engineer needs to process data from Amazon RDS and MongoDB, perform transformations, and load the transformed data into Amazon Redshift for analytics. The data updates must occur every hour.

Which combination of tasks will meet these requirements with the LEAST operational overhead? (Choose two.)

Minoel Prendi · Accepted Answer

Configure AWS Glue triggers to run the ETL jobs even/ hour.

Minoel Prendi · Accepted Answer

Use AWS Glue connections to establish connectivity between the data sources and Amazon Redshift.

Minoel Prendi · Answer

Use AWS Glue DataBrewto clean and prepare the data for analytics.

Minoel Prendi · Answer

Use AWS Lambda functions to schedule and run the ETL jobs even/ hour.

Minoel Prendi · Answer

Use the Redshift Data API to load transformed data into Amazon Redshift.

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 29 - DEA-C01 discussion

Suggested answer: A, D

0 comments