ExamGecko
Question list
Search
Search

List of questions

Search

Question 68 - DEA-C01 discussion

Report
Export

As a Data Engineer, you have requirement to query most recent data from the Large Dataset that reside in the external cloud storage, how would you design your data pipelines keeping in mind fastest time to delivery?

A.
Data pipelines would be created to first load data into internal stages & then into Per-manent table with SCD Type 2 transformation.
Answers
A.
Data pipelines would be created to first load data into internal stages & then into Per-manent table with SCD Type 2 transformation.
B.
Direct Querying External tables on top of existing data stored in external cloud storage for analysis without first loading it into Snowflake.
Answers
B.
Direct Querying External tables on top of existing data stored in external cloud storage for analysis without first loading it into Snowflake.
C.
Unload data into SnowFlake Internal data storage using PUT command.
Answers
C.
Unload data into SnowFlake Internal data storage using PUT command.
D.
Snowpipe can be leveraged with streams to load data in micro batch fashion with CDC streams that capture most recent data only.
Answers
D.
Snowpipe can be leveraged with streams to load data in micro batch fashion with CDC streams that capture most recent data only.
E.
External tables with Materialized views can be created in Snowflake.
Answers
E.
External tables with Materialized views can be created in Snowflake.
Suggested answer: E

Explanation:

In a typical table, the data is stored in the database; however, in an external table, the data is stored in files in an external stage. External tables store file-level metadata about the data files, such as the filename, a version identifier and related properties. This enables querying data stored in files in an external stage as if it were inside a database. External tables can access data stored in any format supported by COPY INTO <table> statements.

External tables are read-only, therefore no DML operations can be performed on them; however, external tables can be used for query and join operations. Views can be created against external tables.

Querying data stored external to the database is likely to be slower than querying native database tables; however, materialized views based on external tables can improve query performance.

Creating External tables enable user for querying existing data stored in external cloud storage for analysis without first loading it into Snowflake. The source of truth for the data remains in the external cloud storage. Data sets materialized in Snowflake via materialized views are read-only.

This solution is especially beneficial to accounts that have a large amount of data stored in external cloud storage and only want to query a portion of the data; for example, the most recent data. Users can create materialized views on subsets of this data for improved query performance.

asked 23/09/2024
Olive Wang
39 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first