ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 105 - DP-203 discussion

Report
Export

You plan to perform batch processing in Azure Databricks once daily. Which type of Databricks cluster should you use?

A.
High Concurrency
Answers
A.
High Concurrency
B.
automated
Answers
B.
automated
C.
interactive
Answers
C.
interactive
Suggested answer: C

Explanation:

Azure Databricks has two types of clusters: interactive and automated. You use interactive clusters to analyze data collaboratively with interactive notebooks. You use automated clusters to run fast and robust automated jobs. Example: Scheduled batch workloads (data engineers running ETL jobs) This scenario involves running batch job JARs and notebooks on a regular cadence through the Databricks platform. The suggested best practice is to launch a new cluster for each run of critical jobs. This helps avoid any issues (failures, missing SLA, and so on) due to an existing workload (noisy neighbor) on a shared cluster.

Reference: https://docs.databricks.com/administration-guide/cloud-configurations/aws/cmbp.html#scenario-3-scheduled-batch-workloads-data-engineers-running-etl-jobs

asked 02/10/2024
Hamid Assimi
38 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first