Home / Microsoft / DP-100 / List of questions

Ask Question

Microsoft DP-100 Practice Test - Questions Answers, Page 6

Add to Whishlist

List of questions

Question 51

DRAG DROP

You have a model with a large difference between the training and validation error values.

You must create a new model and perform cross-validation.

You need to identify a parameter set for the new model using Azure Machine Learning Studio.

Which module you should use for each step? To answer, drag the appropriate modules to the correct steps. Each module may be used once or more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.

NOTE: Each correct selection is worth one point.

Microsoft DP-100 image Question 51 146857 05072025011644000

Drag and Drop Show Answer Comment (0)

Correct answer: Microsoft DP-100 image answer Question 51 146857 05072025011644000

Explanation:

Box 1: Split data

Box 2: Partition and Sample

Box 3: Two-Class Boosted Decision Tree

Box 4: Tune Model Hyperparameters Integrated train and tune: You configure a set of parameters to use, and then let the module iterate over multiple combinations, measuring accuracy until it finds a "best" model. With most learner modules, you can choose which parameters should be changed during the training process, and which should remain fixed.

We recommend that you use Cross-Validate Model to establish the goodness of the model given the specified parameters. Use Tune Model Hyperparameters to identify the optimal parameters.

References:

https://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/partition-and-sample

asked 07/05/2025

Claudious Utete

46 questions

Question 52

HOTSPOT

You are analyzing the asymmetry in a statistical distribution.

The following image contains two density curves that show the probability distribution of two datasets.

Microsoft DP-100 image Question 231 63882177402895542684438

Use the drop-down menus to select the answer choice that answers each question based on the information presented in the graphic.

NOTE: Each correct selection is worth one point.

Microsoft DP-100 image Question 52 146858 05072025011644000

Hot Area Show Answer Comment (0)

Correct answer: Microsoft DP-100 image answer Question 52 146858 05072025011644000

Explanation:

Box 1: Positive skew

Positive skew values means the distribution is skewed to the right.

Box 2: Negative skew

Negative skewness values mean the distribution is skewed to the left.

References:

https://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/compute-elementary-statistics

asked 07/05/2025

Adrian Kustosz

41 questions

Question 53

HOTSPOT

You train a classification model by using a decision tree algorithm.

You create an estimator by running the following Python code. The variable feature_names is a list of all feature names, and class_names is a list of all class names.

from interpret.ext.blackbox import TabularExplainer

Microsoft DP-100 image Question 232 63882177402895542684438

You need to explain the predictions made by the model for all classes by determining the importance of all features.

For each of the following statements, select Yes if the statement is true. Otherwise, select No.

NOTE: Each correct selection is worth one point.

Microsoft DP-100 image Question 53 146859 05072025011644000

Hot Area Show Answer Comment (0)

Correct answer: Microsoft DP-100 image answer Question 53 146859 05072025011644000

Explanation:

Box 1: Yes

TabularExplainer calls one of the three SHAP explainers underneath (TreeExplainer, DeepExplainer, or KernelExplainer).

Box 2: Yes

To make your explanations and visualizations more informative, you can choose to pass in feature names and output class names if doing classification.

Box 3: No TabularExplainer automatically selects the most appropriate one for your use case, but you can call each of its three underlying explainers underneath (TreeExplainer, DeepExplainer, or KernelExplainer) directly.

Reference:

https://docs.microsoft.com/en-us/azure/machine-learning/how-to-machine-learning-interpretability-aml

asked 07/05/2025

Teste Teste

45 questions

Question 54

DRAG DROP

You have several machine learning models registered in an Azure Machine Learning workspace.

You must use the Fairlearn dashboard to assess fairness in a selected model.

Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.

Microsoft DP-100 image Question 54 146860 05072025011644000

Drag and Drop Show Answer Comment (0)

Correct answer: Microsoft DP-100 image answer Question 54 146860 05072025011644000

Explanation:

Step 1: Select a model feature to be evaluated.

Step 2: Select a binary classification or regression model.

Register your models within Azure Machine Learning. For convenience, store the results in a dictionary, which maps the id of the registered model (a string in name:version format) to the predictor itself.

Example:

model_dict = {}

lr_reg_id = register_model("fairness_logistic_regression", lr_predictor)

model_dict[lr_reg_id] = lr_predictor

svm_reg_id = register_model("fairness_svm", svm_predictor)

model_dict[svm_reg_id] = svm_predictor

Step 3: Select a metric to be measured

Precompute fairness metrics.

Create a dashboard dictionary using Fairlearn's metrics package.

Reference:

https://docs.microsoft.com/en-us/azure/machine-learning/how-to-machine-learning-fairness-aml

asked 07/05/2025

Manuela Kays

40 questions

Question 55

HOTSPOT

A biomedical research company plans to enroll people in an experimental medical treatment trial.

You create and train a binary classification model to support selection and admission of patients to the trial. The model includes the following features: Age, Gender, and Ethnicity.

The model returns different performance metrics for people from different ethnic groups.

You need to use Fairlearn to mitigate and minimize disparities for each category in the Ethnicity feature.

Which technique and constraint should you use? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

Microsoft DP-100 image Question 55 146861 05072025011644000

Hot Area Show Answer Comment (0)

Correct answer: Microsoft DP-100 image answer Question 55 146861 05072025011644000

Explanation:

Box 1: Grid Search

Fairlearn open-source package provides postprocessing and reduction unfairness mitigation algorithms: ExponentiatedGradient, GridSearch, and ThresholdOptimizer.

Note: The Fairlearn open-source package provides postprocessing and reduction unfairness mitigation algorithms types:

Reduction: These algorithms take a standard black-box machine learning estimator (e.g., a LightGBM model) and generate a set of retrained models using a sequence of re-weighted training datasets.

Post-processing: These algorithms take an existing classifier and the sensitive feature as input.

Box 2: Demographic parity

The Fairlearn open-source package supports the following types of parity constraints: Demographic parity, Equalized odds, Equal opportunity, and Bounded group loss.

Reference:

https://docs.microsoft.com/en-us/azure/machine-learning/concept-fairness-ml

asked 07/05/2025

Mathijs Sijm

34 questions

Question 56

You are developing a hands-on workshop to introduce Docker for Windows to attendees.

You need to ensure that workshop attendees can install Docker on their devices.

Which two prerequisite components should attendees install on the devices? Each correct answer presents part of the solution.

NOTE: Each correct selection is worth one point.

Microsoft Hardware-Assisted Virtualization Detection Tool

Kitematic

BIOS-enabled virtualization

VirtualBox

Windows 10 64-bit Professional

Show Answer Comment (0)

Question 57

Your team is building a data engineering and data science development environment.

The environment must support the following requirements:

support Python and Scala

compose data storage, movement, and processing services into automated data pipelines

the same tool should be used for the orchestration of both data engineering and data science

support workload isolation and interactive workloads

enable scaling across a cluster of machines

You need to create the environment.

What should you do?

Build the environment in Apache Hive for HDInsight and use Azure Data Factory for orchestration.

Build the environment in Azure Databricks and use Azure Data Factory for orchestration.

Build the environment in Apache Spark for HDInsight and use Azure Container Instances for orchestration.

Build the environment in Azure Databricks and use Azure Container Instances for orchestration.

Show Answer Comment (0)

Question 58

You plan to build a team data science environment. Data for training models in machine learning pipelines will be over 20 GB in size.

You have the following requirements:

Models must be built using Caffe2 or Chainer frameworks.

Data scientists must be able to use a data science environment to build the machine learning pipelines and train models on their personal devices in both connected and disconnected network environments.

Personal devices must support updating machine learning pipelines when connected to a network.

You need to select a data science environment.

Which environment should you use?

Azure Machine Learning Service

Azure Machine Learning Studio

Azure Databricks

Azure Kubernetes Service (AKS)

Show Answer Comment (0)

Question 59

You are implementing a machine learning model to predict stock prices.

The model uses a PostgreSQL database and requires GPU processing.

You need to create a virtual machine that is pre-configured with the required tools.

What should you do?

Create a Data Science Virtual Machine (DSVM) Windows edition.

Create a Geo Al Data Science Virtual Machine (Geo-DSVM) Windows edition.

Create a Deep Learning Virtual Machine (DLVM) Linux edition.

Create a Deep Learning Virtual Machine (DLVM) Windows edition.

Show Answer Comment (0)

Question 60

You are developing deep learning models to analyze semi-structured, unstructured, and structured data types.

You have the following data available for model building:

Video recordings of sporting events

Transcripts of radio commentary about events

Logs from related social media feeds captured during sporting events

You need to select an environment for creating the model.

Which environment should you use?

Azure Cognitive Services

Azure Data Lake Analytics

Azure HDInsight with Spark MLib

Azure Machine Learning Studio

Show Answer Comment (0)

Question

Case Study

Drag and Drop

Zoom: 100%

Reset OK

Hot Area

Zoom: 100%

Reset OK

Question 51 (0)

DRAG DROP You have a model with a large difference between the training and validation error values. You must create a new model and perform cross-validation. You need to identify a parameter set

Question 52 (0)

HOTSPOT You are analyzing the asymmetry in a statistical distribution. The following image contains two density curves that show the probability distribution of two datasets. Use the drop-down

Question 53 (0)

HOTSPOT You train a classification model by using a decision tree algorithm. You create an estimator by running the following Python code. The variable feature_names is a list of all feature names

Question 54 (0)

DRAG DROP You have several machine learning models registered in an Azure Machine Learning workspace. You must use the Fairlearn dashboard to assess fairness in a selected model. Which three acti

Question 55 (0)

HOTSPOT A biomedical research company plans to enroll people in an experimental medical treatment trial. You create and train a binary classification model to support selection and admission of pa

Question 56 (0)

You are developing a hands-on workshop to introduce Docker for Windows to attendees. You need to ensure that workshop attendees can install Docker on their devices. Which two prerequisite componen

Question 57 (0)

Your team is building a data engineering and data science development environment. The environment must support the following requirements: support Python and Scala compose data storage, movement

Question 58 (0)

You plan to build a team data science environment. Data for training models in machine learning pipelines will be over 20 GB in size. You have the following requirements: Models must be built usin

Question 59 (0)

You are implementing a machine learning model to predict stock prices. The model uses a PostgreSQL database and requires GPU processing. You need to create a virtual machine that is pre-configured

Question 60 (0)

You are developing deep learning models to analyze semi-structured, unstructured, and structured data types. You have the following data available for model building: Video recordings of sporting

Export

with questions and answers will be exported as:

VPLUS file

PDF file (Demo 30 questions)

Highest scored 'comment-votes'

Your question list is being generated. We'll email you once it’s ready.

If you didn't receive an email within 5 minutes, you should submit a support request to [email protected].