You are designing a real-time system for a ride hailing app that identifies areas with high demand for rides to effectively reroute available drivers to meet the demand. The system ingests data from multiple sources to Pub/Sub. processes the data, and stores the results for visualization and analysis in real-time dashboards. The data sources include driver location updates every 5 seconds and app-based booking events from riders. The data processing involves real-time aggregation of supply and demand data for the last 30 seconds, every 2 seconds, and storing the results in a low-latency system for visualization. What should you do?

Question

Angel Molina · Accepted Answer

Group the data by using a hopping window in a Dataflow pipeline, and write the aggregated data to Memorystore

Angel Molina · Answer

Group the data by using a tumbling window in a Dataflow pipeline, and write the aggregated data to Memorystore

Angel Molina · Answer

Group the data by using a session window in a Dataflow pipeline, and write the aggregated data to BigQuery.

Angel Molina · Answer

Group the data by using a hopping window in a Dataflow pipeline, and write the aggregated data to BigQuery.

Question list

List of questions

Question 1

(0)

Question 2

(0)

Question 3

(0)

Question 4

(0)

Question 5

(0)

Question 6

(0)

Question 7

(0)

Question 8

(0)

Question 9

(0)

Question 10

(0)

Related questions

Question 284 - Professional Data Engineer discussion

Suggested answer: B

0 comments