ExamGecko
Home Home / Microsoft / DP-203

Microsoft DP-203 Practice Test - Questions Answers, Page 5

Question list
Search
Search

List of questions

Search

Related questions











You have a SQL pool in Azure Synapse.

A user reports that queries against the pool take longer than expected to complete. You determine that the issue relates to queried columnstore segments. You need to add monitoring to the underlying storage to help diagnose the issue. Which two metrics should you monitor? Each correct answer presents part of the solution. NOTE: Each correct selection is worth one point.

A.
Snapshot Storage Size
A.
Snapshot Storage Size
Answers
B.
Cache used percentage
B.
Cache used percentage
Answers
C.
DWU Limit
C.
DWU Limit
Answers
D.
Cache hit percentage
D.
Cache hit percentage
Answers
Suggested answer: B, D

Explanation:

D: Cache hit percentage: (cache hits / cache miss) * 100 where cache hits is the sum of all columnstore segments hits in the local SSD cache and cache miss is the columnstore segments misses in the local SSD cache summed across all nodes

B: (cache used / cache capacity) * 100 where cache used is the sum of all bytes in the local SSD cache across all nodes and cache capacity is the sum of the storage capacity of the local SSD cache across all nodes Incorrect Asnwers:

C: DWU limit: Service level objective of the data warehouse.

Reference: https://docs.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/sql-data-warehouse-concept-resource-utilization-query-activity

You manage an enterprise data warehouse in Azure Synapse Analytics. Users report slow performance when they run commonly used queries. Users do not report performance changes for infrequently used queries. You need to monitor resource utilization to determine the source of the performance issues. Which metric should you monitor?

A.
DWU percentage
A.
DWU percentage
Answers
B.
Cache hit percentage
B.
Cache hit percentage
Answers
C.
DWU limit
C.
DWU limit
Answers
D.
Data IO percentage
D.
Data IO percentage
Answers
Suggested answer: B

Explanation:

Monitor and troubleshoot slow query performance by determining whether your workload is optimally leveraging the adaptive cache for dedicated SQL pools.

Reference: https://docs.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/sql-data-warehouse-how-to-monitor-cache

You have an Azure Databricks resource.

You need to log actions that relate to changes in compute for the Databricks resource. Which Databricks services should you log?

A.
clusters
A.
clusters
Answers
B.
workspace
B.
workspace
Answers
C.
DBFS
C.
DBFS
Answers
D.
SSH
D.
SSH
Answers
E.
jobs
E.
jobs
Answers
Suggested answer: B

Explanation:

Databricks provides access to audit logs of activities performed by Databricks users, allowing your enterprise to monitor detailed Databricks usage patterns. There are two types of logs:

Workspace-level audit logs with workspace-level events. Account-level audit logs with account-level events.

Reference: https://docs.databricks.com/administration-guide/account-settings/audit-logs.html

You are designing a highly available Azure Data Lake Storage solution that will include geo-zone-redundant storage (GZRS). You need to monitor for replication delays that can affect the recovery point objective (RPO). What should you include in the monitoring solution?

A.
5xx: Server Error errors
A.
5xx: Server Error errors
Answers
B.
Average Success E2E Latency
B.
Average Success E2E Latency
Answers
C.
availability
C.
availability
Answers
D.
Last Sync Time
D.
Last Sync Time
Answers
Suggested answer: D

Explanation:

Because geo-replication is asynchronous, it is possible that data written to the primary region has not yet been written to the secondary region at the time an outage occurs. The Last Sync Time property indicates the last time that data from the primary region was written successfully to the secondary region. All writes made to the primary region before the last sync time are available to be read from the secondary location. Writes made to the primary region after the last sync time property may or may not be available for reads yet.

Reference:

https://docs.microsoft.com/en-us/azure/storage/common/last-sync-time-get

You configure monitoring from an Azure Synapse Analytics implementation. The implementation uses PolyBase to load data from comma-separated value (CSV) files stored in Azure Data Lake Storage Gen2 using an external table. Files with an invalid schema cause errors to occur.

You need to monitor for an invalid schema error.

For which error should you monitor?

A.
EXTERNAL TABLE access failed due to internal error: 'Java exception raised on call to HdfsBridge_Connect: Error [com.microsoft.polybase.client.KerberosSecureLogin] occurred while accessing external file.'
A.
EXTERNAL TABLE access failed due to internal error: 'Java exception raised on call to HdfsBridge_Connect: Error [com.microsoft.polybase.client.KerberosSecureLogin] occurred while accessing external file.'
Answers
B.
Cannot execute the query "Remote Query" against OLE DB provider "SQLNCLI11" for linked server "(null)". Query aborted- the maximum reject threshold (0 rows) was reached while reading from an external source: 1 rows rejected outof total 1 rows processed.
B.
Cannot execute the query "Remote Query" against OLE DB provider "SQLNCLI11" for linked server "(null)". Query aborted- the maximum reject threshold (0 rows) was reached while reading from an external source: 1 rows rejected outof total 1 rows processed.
Answers
C.
EXTERNAL TABLE access failed due to internal error: 'Java exception raised on call to HdfsBridge_Connect: Error [Unable to instantiate LoginClass] occurred while accessing external file.'
C.
EXTERNAL TABLE access failed due to internal error: 'Java exception raised on call to HdfsBridge_Connect: Error [Unable to instantiate LoginClass] occurred while accessing external file.'
Answers
D.
EXTERNAL TABLE access failed due to internal error: 'Java exception raised on call to HdfsBridge_Connect: Error [No FileSystem for scheme: wasbs] occurred while accessing external file.'
D.
EXTERNAL TABLE access failed due to internal error: 'Java exception raised on call to HdfsBridge_Connect: Error [No FileSystem for scheme: wasbs] occurred while accessing external file.'
Answers
Suggested answer: B

Explanation:

Error message: Cannot execute the query "Remote Query"

Possible Reason:

The reason this error happens is because each file has different schema. The PolyBase external table DDL when pointed to a directory recursively reads all the files in that directory. When a column or data type mismatch happens, this error could be seen in SSMS.

Reference:

https://docs.microsoft.com/en-us/sql/relational-databases/polybase/polybase-errors-and-possible-solutions

You have an Azure Synapse Analytics dedicated SQL pool.

You run PDW_SHOWSPACEUSED('dbo.FactInternetSales'); and get the results shown in the following table.

Which statement accurately describes the dbo.FactInternetSales table?

A.
All distributions contain data.
A.
All distributions contain data.
Answers
B.
The table contains less than 10,000 rows.
B.
The table contains less than 10,000 rows.
Answers
C.
The table uses round-robin distribution.
C.
The table uses round-robin distribution.
Answers
D.
The table is skewed.
D.
The table is skewed.
Answers
Suggested answer: D

You have two fact tables named Flight and Weather. Queries targeting the tables will be based on the join between the following columns.

You need to recommend a solution that maximizes query performance.

What should you include in the recommendation?

A.
In the tables use a hash distribution of ArrivalDateTime and ReportDateTime.
A.
In the tables use a hash distribution of ArrivalDateTime and ReportDateTime.
Answers
B.
In the tables use a hash distribution of ArrivalAirportID and AirportID.
B.
In the tables use a hash distribution of ArrivalAirportID and AirportID.
Answers
C.
In each table, create an IDENTITY column.
C.
In each table, create an IDENTITY column.
Answers
D.
In each table, create a column as a composite of the other two columns in the table.
D.
In each table, create a column as a composite of the other two columns in the table.
Answers
Suggested answer: B

Explanation:

Hash-distribution improves query performance on large fact tables. Incorrect Answers:

A: Do not use a date column for hash distribution. All data for the same date lands in the same distribution. If several users are all filtering on the same date, then only 1 of the 60 distributions do all the processing work.

You have several Azure Data Factory pipelines that contain a mix of the following types of activities:

Wrangling data flow

Notebook

Copy Jar

Which two Azure services should you use to debug the activities? Each correct answer presents part of the solution. NOTE: Each correct selection is worth one point

A.
Azure Synapse Analytics
A.
Azure Synapse Analytics
Answers
B.
Azure HDInsight
B.
Azure HDInsight
Answers
C.
Azure Machine Learning
C.
Azure Machine Learning
Answers
D.
Azure Data Factory
D.
Azure Data Factory
Answers
E.
Azure Databricks
E.
Azure Databricks
Answers
Suggested answer: B, D

You have an Azure Synapse Analytics dedicated SQL pool named Pool1 and a database named DB1. DB1 contains a fact table named Table1. You need to identify the extent of the data skew in Table1.

What should you do in Synapse Studio?

A.
Connect to the built-in pool and run sys.dm_pdw_nodes_db_partition_stats.
A.
Connect to the built-in pool and run sys.dm_pdw_nodes_db_partition_stats.
Answers
B.
Connect to Pool1 and run DBCC CHECKALLOC.
B.
Connect to Pool1 and run DBCC CHECKALLOC.
Answers
C.
Connect to the built-in pool and run DBCC CHECKALLOC.
C.
Connect to the built-in pool and run DBCC CHECKALLOC.
Answers
D.
Connect to Pool1 and query sys.dm_pdw_nodes_db_partition_stats.
D.
Connect to Pool1 and query sys.dm_pdw_nodes_db_partition_stats.
Answers
Suggested answer: D

Explanation:

Microsoft recommends use of sys.dm_pdw_nodes_db_partition_stats to analyze any skewness in the data. Reference:

https://docs.microsoft.com/en-us/sql/relational-databases/system-dynamic-management-views/sys-dm-db-partition-stats-transact-sql https://docs.microsoft.com/en-us/azure/synapse-analytics/sql-data-warehouse/cheat-sheet

A company purchases IoT devices to monitor manufacturing machinery. The company uses an Azure IoT Hub to communicate with the IoT devices. The company must be able to monitor the devices in real-time. You need to design the solution.

What should you recommend?

A.
Azure Data Factory instance using Azure Portal
A.
Azure Data Factory instance using Azure Portal
Answers
B.
Azure Data Factory instance using Azure PowerShell
B.
Azure Data Factory instance using Azure PowerShell
Answers
C.
Azure Stream Analytics cloud job using Azure Portal
C.
Azure Stream Analytics cloud job using Azure Portal
Answers
D.
Azure Data Factory instance using Microsoft Visual Studio
D.
Azure Data Factory instance using Microsoft Visual Studio
Answers
Suggested answer: A

Explanation:


Total 320 questions
Go to page: of 32