ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 312 - Professional Data Engineer discussion

Report
Export

You are part of a healthcare organization where data is organized and managed by respective data owners in various storage services. As a result of this decentralized ecosystem, discovering and managing data has become difficult You need to quickly identify and implement a cost-optimized solution to assist your organization with the following

* Data management and discovery

* Data lineage tracking

* Data quality validation

How should you build the solution?

A.
Use BigLake to convert the current solution into a data lake architecture.
Answers
A.
Use BigLake to convert the current solution into a data lake architecture.
B.
Build a new data discovery tool on Google Kubernetes Engine that helps with new source onboarding and data lineage tracking.
Answers
B.
Build a new data discovery tool on Google Kubernetes Engine that helps with new source onboarding and data lineage tracking.
C.
Use BigOuery to track data lineage, and use Dataprep to manage data and perform data quality validation.
Answers
C.
Use BigOuery to track data lineage, and use Dataprep to manage data and perform data quality validation.
D.
Use Dataplex to manage data, track data lineage, and perform data quality validation.
Answers
D.
Use Dataplex to manage data, track data lineage, and perform data quality validation.
Suggested answer: D

Explanation:

Dataplex is a Google Cloud service that provides a unified data fabric for data lakes and data warehouses. It enables data governance, management, and discovery across multiple data domains, zones, and assets. Dataplex also supports data lineage tracking, which shows the origin and transformation of data over time. Dataplex also integrates with Dataprep, a data preparation and quality tool that allows users to clean, enrich, and transform data using a visual interface. Dataprep can also monitor data quality and detect anomalies using machine learning. Therefore, Dataplex is the most suitable solution for the given scenario, as it meets all the requirements of data management and discovery, data lineage tracking, and data quality validation.Reference:

Dataplex overview

Automate data governance, extend your data fabric with Dataplex-BigLake integration

Dataprep documentation

asked 18/09/2024
Siraj Moosa
33 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first