ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 21 - Professional Data Engineer discussion

Report
Export

Your company uses a proprietary system to send inventory data every 6 hours to a data ingestion service in the cloud. Transmitted data includes a payload of several fields and the timestamp of the transmission. If there are any concerns about a transmission, the system re-transmits the dat a. How should you deduplicate the data most efficiency?

A.
Assign global unique identifiers (GUID) to each data entry.
Answers
A.
Assign global unique identifiers (GUID) to each data entry.
B.
Compute the hash value of each data entry, and compare it with all historical data.
Answers
B.
Compute the hash value of each data entry, and compare it with all historical data.
C.
Store each data entry as the primary key in a separate database and apply an index.
Answers
C.
Store each data entry as the primary key in a separate database and apply an index.
D.
Maintain a database table to store the hash value and other metadata for each data entry.
Answers
D.
Maintain a database table to store the hash value and other metadata for each data entry.
Suggested answer: D
asked 18/09/2024
Naing Thet
41 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first