ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 94 - Professional Data Engineer discussion

Report
Export

Which Cloud Dataflow / Beam feature should you use to aggregate data in an unbounded data source every hour based on the time when the data entered the pipeline?

A.
An hourly watermark
Answers
A.
An hourly watermark
B.
An event time trigger
Answers
B.
An event time trigger
C.
The with Allowed Lateness method
Answers
C.
The with Allowed Lateness method
D.
A processing time trigger
Answers
D.
A processing time trigger
Suggested answer: D

Explanation:

When collecting and grouping data into windows, Beam uses triggers to determine when to emit the aggregated results of each window.

Processing time triggers. These triggers operate on the processing time ñ the time when the data element is processed at any given stage in the pipeline.

Event time triggers. These triggers operate on the event time, as indicated by the timestamp on each data element. Beam's default trigger is event time-based.

Reference: https://beam.apache.org/documentation/programming-guide/#triggers

asked 18/09/2024
Mia Mattsson
45 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first