List of questions
Related questions
Question 74 - CBDA discussion
A data scientist at a consumer goods company, has been asked to do a detailed analysis on customer profiles. The Data Scientist has identified an external data source that carries valuable additional information on their customers. The data scientist also identifies the address column as the most reliable column to join the internal data source with the external data source. Addresses may appear in different formats for example:
File A = '13 Smith St'
File B = 'Unit 7, 13 Smith Street'
Which of the following techniques would be useful in this situation?
A.
Deterministic linkage
B.
Probabilistic linkage
C.
Genetic linkage
D.
Cuff linkage
Your answer:
0 comments
Sorted by
Leave a comment first