ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 69 - Certified Data Cloud Consultant discussion

Report
Export

Every day, Northern Trail Outfitters uploads a summary of the last 24 hours of store transactions to a new file in an Amazon S3 bucket, and files older than seven days are automatically deleted. Each file contains a timestamp in a standardized naming convention.

Which two options should a consultant configure when ingesting this data stream?

Choose 2 answers

A.
Ensure that deletion of old files is enabled.
Answers
A.
Ensure that deletion of old files is enabled.
B.
Ensure the refresh mode is set to 'Upsert'.
Answers
B.
Ensure the refresh mode is set to 'Upsert'.
C.
Ensure the filename contains a wildcard to a accommodate the timestamp.
Answers
C.
Ensure the filename contains a wildcard to a accommodate the timestamp.
D.
Ensure the refresh mode is set to 'Full Refresh.''
Answers
D.
Ensure the refresh mode is set to 'Full Refresh.''
Suggested answer: B, C

Explanation:

: When ingesting data from an Amazon S3 bucket, the consultant should configure the following options:

The refresh mode should be set to ''Upsert'', which means that new and updated records will be added or updated in Data Cloud, while existing records will be preserved. This ensures that the data is always up to date and consistent with the source.

The filename should contain a wildcard to accommodate the timestamp, which means that the file name pattern should include a variable part that matches the timestamp format. For example, if the file name isstore_transactions_2023-12-18.csv, the wildcard could bestore_transactions_*.csv. This ensures that the ingestion process can identify and process the correct file every day.

The other options are not necessary or relevant for this scenario:

Deletion of old files is a feature of the Amazon S3 bucket, not the Data Cloud ingestion process. Data Cloud does not delete any files from the source, nor does it require the source files to be deleted after ingestion.

Full Refresh is a refresh mode that deletes all existing records in Data Cloud and replaces them with the records from the source file. This is not suitable for this scenario, as it would result in data loss and inconsistency, especially if the source file only contains the summary of the last 24 hours of transactions.

asked 23/09/2024
Maria Lilian Tongson
41 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first