Home

Home / Snowflake / DEA-C01

Question list

List of questions

Question 1

(0)

Streams cannot be created to query change data on which of the following objects? [Select All that Apply]

Question 2

(0)

Tasks may optionally use table streams to provide a convenient way to continuously process new or changed dat a. A task can transform new or changed rows that a stream surfaces. Each time a task is

Question 3

(0)

Question 4

(0)

Mark a Data Engineer, looking to implement streams on local views & want to use change tracking metadata for one of its Data Loading use case. Please select the incorrect understanding points of

Question 5

(0)

To advance the offset of a stream to the current table version without consuming the change data in a DML operation, which of the following operations can be done by Data Engineer? [Select 2]

Question 6

(0)

Data Engineer is performing below steps in sequence while working on Stream s1 created on table t1. Step 1: Begin transaction. Step 2: Query stream s1 on table t1. Step 3: Update rows in table t1

Question 7

(0)

Streams record the differences between two offsets. If a row is added and then updated in the current offset, what will be the value of METADATA^^SUPDATE Columns in this scenario?

Question 8

(0)

Mark the Incorrect Statements with respect to types of streams supported by Snowflake?

Question 9

(0)

Stuart, a Lead Data Engineer in MACRO Data Company created streams on set of External tables. He has been asked to extend the data retention period of the stream for 90 days, which parameter he can

Question 10

(0)

Ron, Snowflake Developer needs to capture change data (insert only) on the source views, for that he follows the below steps: Enable change tracking on the source views & its underlying tables.

Open VPLUS File

Convert VPLUS to PDF

Related questions

Select the correct usage statements with regards to SQL UDF?

For the most efficient and cost-effective Data load experience, Data Engineer needs to inconsider-ate which of the following considerations?

Which is the non-supportable JavaScript UDF data types?

Which are supported Programming Languages for Creating UDTFs?

Snowflake web interface can be used to create users with no passwords or remove passwords from existing users?

To troubleshoot data load failure in one of your Copy Statement, Data Engineer have Executed a COPY statement with the VALIDATION_MODE copy option set to RETURN_ALL_ERRORS with reference to the set of files he had attempted to load. Which below function can facilitate analysis of the problematic records on top of the Results produced? [Select 2]

Regular views do not cache data, and therefore cannot improve performance by caching?

For enabling non-ACCOUNTADMIN Roles to Perform Data Sharing Tasks, which two glob-al/account privileges snowflake provide?

For SQL UDFs, The invoker of the function need not have access to the objects referenced in the function definition, but only needs the privilege to use the function?

When using the CURRENT_ROLE and CURRENT_USER functions with secure UDFs that will be shared with Snowflake accounts, Snowflake returns a NULL value for these functions?

Question 92 - DEA-C01 discussion

Report

What are Common Query Problems a Data Engineer can identified using Query Profiler?

A.

"Exploding" Joins i.e Joins resulting due to a "Cartesian product"

B.

Queries Too Large to Fit in Memory

C.

Inefficient Pruning

D.

Ineffective Data Sharing

Suggested answer: A, B, C

Explanation:

"Exploding" Joins

One of the common mistakes SQL users make is joining tables without providing a join condition (resulting in a "Cartesian product"), or providing a condition where records from one table match multiple records from another table. For such queries, the Join operator produces significantly (often by orders of magnitude) more tuples than it consumes.

This can be observed by looking at the number of records produced by a Join operator in the profile interface, and typically is also reflected in Join operator consuming a lot of time.

Queries Too Large to Fit in Memory

For some operations (e.g. duplicate elimination for a huge data set), the amount of memory available for the compute resources used to execute the operation might not be sufficient to hold intermediate results. As a result, the query processing engine will start spilling the data to local disk.

If the local disk space is not sufficient, the spilled data is then saved to remote disks.

This spilling can have a profound effect on query performance (especially if remote disk is used for spilling).

Spilling statistics can be checked in Query Profile Interface.

Inefficient Pruning

Snowflake collects rich statistics on data allowing it not to read unnecessary parts of a table based on the query filters. However, for this to have an effect, the data storage order needs to be correlat-ed with the query filter attributes.

The efficiency of pruning can be observed by comparing Partitions scanned and Partitions total statistics in the TableScan operators. If the former is a small fraction of the latter, pruning is efficient. If not, the pruning did not have an effect.

Of course, pruning can only help for queries that actually filter out a significant amount of data. If the pruning statistics do not show data reduction, but there is a Filter operator above TableScan which filters out a number of records, this might signal that a different data organization might be beneficial for this query.

Show Answer

asked 23/09/2024

bijay ghimire

37 questions

User

Your answer: A B C D

0 comments

Sorted by

Leave a comment first