ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 186 - Professional Data Engineer discussion

Report
Export

You are creating a new pipeline in Google Cloud to stream IoT data from Cloud Pub/Sub through Cloud Dataflow to BigQuery. While previewing the data, you notice that roughly 2% of the data appears to be corrupt. You need to modify the Cloud Dataflow pipeline to filter out this corrupt dat a. What should you do?

A.
Add a SideInput that returns a Boolean if the element is corrupt.
Answers
A.
Add a SideInput that returns a Boolean if the element is corrupt.
B.
Add a ParDo transform in Cloud Dataflow to discard corrupt elements.
Answers
B.
Add a ParDo transform in Cloud Dataflow to discard corrupt elements.
C.
Add a Partition transform in Cloud Dataflow to separate valid data from corrupt data.
Answers
C.
Add a Partition transform in Cloud Dataflow to separate valid data from corrupt data.
D.
Add a GroupByKey transform in Cloud Dataflow to group all of the valid data together and discard the rest.
Answers
D.
Add a GroupByKey transform in Cloud Dataflow to group all of the valid data together and discard the rest.
Suggested answer: B
asked 18/09/2024
Abdul Majid Pasha
39 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first