List of questions
Related questions
Question 246 - Professional Data Engineer discussion
You are testing a Dataflow pipeline to ingest and transform text files. The files are compressed gzip, errors are written to a dead-letter queue, and you are using Sidelnputs to join data You noticed that the pipeline is taking longer to complete than expected, what should you do to expedite the Dataflow job?
A.
Switch to compressed Avro files
B.
Reduce the batch size
C.
Retry records that throw an error
D.
Use CoGroupByKey instead of the Sidelnput
Your answer:
0 comments
Sorted by
Leave a comment first