A company stores CSV files in an Amazon S3 bucket. A data engineer needs to process the data in the CSV files and store the processed data in a new S3 bucket.
The process needs to rename a column, remove specific columns, ignore the second row of each file, create a new column based on the values of the first row of the data, and filter the results by a numeric value of a column.
Which solution will meet these requirements with the LEAST development effort?

Question

A company stores CSV files in an Amazon S3 bucket. A data engineer needs to process the data in the CSV files and store the processed data in a new S3 bucket.

The process needs to rename a column, remove specific columns, ignore the second row of each file, create a new column based on the values of the first row of the data, and filter the results by a numeric value of a column.

Which solution will meet these requirements with the LEAST development effort?

Nathan Phelan · Accepted Answer

Use AWS Glue DataBrew recipes to read and transform the CSV files.

Nathan Phelan · Answer

Use AWS Glue Python jobs to read and transform the CSV files.

Nathan Phelan · Answer

Use an AWS Glue custom crawler to read and transform the CSV files.

Nathan Phelan · Answer

Use an AWS Glue workflow to build a set of jobs to crawl and transform the CSV files.

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 107 - DEA-C01 discussion

Suggested answer: D

0 comments