You have a data pipeline with a Dataflow job that aggregates and writes time series metrics to Bigtable. You notice that data is slow to update in Bigtable. This data feeds a dashboard used by thousands of users across the organization. You need to support additional concurrent users and reduce the amount of time required to write the data. What should you do?
Choose 2 answers

Question

You have a data pipeline with a Dataflow job that aggregates and writes time series metrics to Bigtable. You notice that data is slow to update in Bigtable. This data feeds a dashboard used by thousands of users across the organization. You need to support additional concurrent users and reduce the amount of time required to write the data. What should you do?

Choose 2 answers

Channa Leang · Accepted Answer

Increase the maximum number of Dataflow workers by setting maxNumWorkers in PipelineOptions.

Channa Leang · Accepted Answer

Increase the number of nodes in the Bigtable cluster.

Channa Leang · Answer

Configure your Dataflow pipeline to use local execution.

Channa Leang · Answer

Modify your Dataflow pipeline lo use the Flatten transform before writing to Bigtable.

Channa Leang · Answer

Modify your Dataflow pipeline to use the CoGrcupByKey transform before writing to Bigtable.

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 332 - Professional Data Engineer discussion

Suggested answer: D, E

0 comments