A company uses an Amazon Redshift provisioned cluster as its database. The Redshift cluster has five reserved ra3.4xlarge nodes and uses key distribution.
A data engineer notices that one of the nodes frequently has a CPU load over 90%. SQL Queries that run on the node are queued. The other four nodes usually have a CPU load under 15% during daily operations.
The data engineer wants to maintain the current number of compute nodes. The data engineer also wants to balance the load more evenly across all five compute nodes.
Which solution will meet these requirements?

Question

A company uses an Amazon Redshift provisioned cluster as its database. The Redshift cluster has five reserved ra3.4xlarge nodes and uses key distribution.

A data engineer notices that one of the nodes frequently has a CPU load over 90%. SQL Queries that run on the node are queued. The other four nodes usually have a CPU load under 15% during daily operations.

The data engineer wants to maintain the current number of compute nodes. The data engineer also wants to balance the load more evenly across all five compute nodes.

Which solution will meet these requirements?

Jevgenij Å½arikov · Accepted Answer

Change the distribution key to the table column that has the largest dimension.

Jevgenij Å½arikov · Answer

Change the sort key to be the data column that is most often used in a WHERE clause of the SQL SELECT statement.

Jevgenij Å½arikov · Answer

Upgrade the reserved node from ra3.4xlarqe to ra3.16xlarqe.

Jevgenij Å½arikov · Answer

Change the primary key to be the data column that is most often used in a WHERE clause of the SQL SELECT statement.

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 67 - DEA-C01 discussion

Suggested answer: B

0 comments