List of questions
Related questions
Question 143 - Professional Data Engineer discussion
Your company receives both batch- and stream-based event dat a. You want to process the data using Google Cloud Dataflow over a predictable time period.
However, you realize that in some instances data can arrive late or out of order. How should you design your Cloud Dataflow pipeline to handle data that is late or out of order?
A.
Set a single global window to capture all the data.
B.
Set sliding windows to capture all the lagged data.
C.
Use watermarks and timestamps to capture the lagged data.
D.
Ensure every datasource type (stream or batch) has a timestamp, and use the timestamps to define the logic for lagged data.
Your answer:
0 comments
Sorted by
Leave a comment first