A company is planning to use a provisioned Amazon EMR cluster that runs Apache Spark jobs to perform big data analysis. The company requires high reliability. A big data team must follow best practices for running cost-optimized and long-running workloads on Amazon EMR. The team must find a solution that will maintain the company's current level of performance.
Which combination of resources will meet these requirements MOST cost-effectively? (Choose two.)

Question

A company is planning to use a provisioned Amazon EMR cluster that runs Apache Spark jobs to perform big data analysis. The company requires high reliability. A big data team must follow best practices for running cost-optimized and long-running workloads on Amazon EMR. The team must find a solution that will maintain the company's current level of performance.

Which combination of resources will meet these requirements MOST cost-effectively? (Choose two.)

Ehsan Ali · Accepted Answer

Use Amazon S3 as a persistent data store.

Ehsan Ali · Accepted Answer

Use Graviton instances for core nodes and task nodes.

Ehsan Ali · Answer

Use Hadoop Distributed File System (HDFS) as a persistent data store.

Ehsan Ali · Answer

Use x86-based instances for core nodes and task nodes.

Ehsan Ali · Answer

Use Spot Instances for all primary nodes.

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 38 - DEA-C01 discussion

Suggested answer: B, D

0 comments