ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 7 - BDS-C00 discussion

Report
Export

A large grocery distributor receives daily depletion reports from the field in the form of gzip archives od CSV files uploaded to Amazon S3. The files range from 500MB to 5GB. These files are processed daily by an EMR job.

Recently it has been observed that the file sizes vary, and the EMR jobs take too long. The distributor needs to tune and optimize the data processing workflow with this limited information to improve the performance of the EMR job.

Which recommendation should an administrator provide?

A.
Reduce the HDFS block size to increase the number of task processors.
Answers
A.
Reduce the HDFS block size to increase the number of task processors.
B.
Use bzip2 or Snappy rather than gzip for the archives.
Answers
B.
Use bzip2 or Snappy rather than gzip for the archives.
C.
Decompress the gzip archives and store the data as CSV files.
Answers
C.
Decompress the gzip archives and store the data as CSV files.
D.
Use Avro rather than gzip for the archives.
Answers
D.
Use Avro rather than gzip for the archives.
Suggested answer: B
asked 16/09/2024
Bhavani Simhadri
29 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first