ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 77 - DP-203 discussion

Report
Export

You are designing a fact table named FactPurchase in an Azure Synapse Analytics dedicated SQL pool. The table contains purchases from suppliers for a retail store. FactPurchase will contain the following columns.

FactPurchase will have 1 million rows of data added daily and will contain three years of data.

Transact-SQL queries similar to the following query will be executed daily.

SELECT

SupplierKey, StockItemKey, COUNT(*)

FROM FactPurchase

WHERE DateKey >= 20210101

AND DateKey <= 20210131

GROUP By SupplierKey, StockItemKey

Which table distribution will minimize query times?

A.
replicated
Answers
A.
replicated
B.
hash-distributed on PurchaseKey
Answers
B.
hash-distributed on PurchaseKey
C.
round-robin
Answers
C.
round-robin
D.
hash-distributed on DateKey
Answers
D.
hash-distributed on DateKey
Suggested answer: B

Explanation:

Hash-distributed tables improve query performance on large fact tables, and are the focus of this article. Round-robin tables are useful for improving loading speed. Incorrect:

Not D: Do not use a date column. . All data for the same date lands in the same distribution. If several users are all filtering on the same date, then only 1 of the 60 distributions do all the processing work.

Reference:

https://docs.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/sql-data-warehouse-tables-distribute

asked 02/10/2024
Nicola Grossi
38 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first