Snowflake DSA-C02 Practice Test - Questions Answers, Page 7
List of questions
Related questions
Which metric is not used for evaluating classification models?
A.
Recall
B.
Accuracy
C.
Mean absolute error
D.
Precision
You previously trained a model using a training dataset. You want to detect any data drift in the new data collected since the model was trained.
What should you do?
A.
Create a new dataset using the new data and a timestamp column and create a data drift monitor that uses the training dataset as a baseline and the new dataset as a target.
B.
Create a new version of the dataset using only the new data and retrain the model.
C.
Add the new data to the existing dataset and enable Application Insights for the service where the model is deployed.
D.
Retrained your training dataset after correcting data outliers & no need to introduce new data.
You are training a binary classification model to support admission approval decisions for a college degree program.
How can you evaluate if the model is fair, and doesn't discriminate based on ethnicity?
A.
Evaluate each trained model with a validation dataset and use the model with the highest accuracy score.
B.
Remove the ethnicity feature from the training dataset.
C.
Compare disparity between selection rates and performance metrics across ethnicities.
D.
None of the above.
Which tools helps data scientist to manage ML lifecycle & Model versioning?
A.
MLFlow
B.
Pachyderm
C.
Albert
D.
CRUX
Mark the incorrect statement regarding usage of Snowflake Stream & Tasks?
A.
Snowflake automatically resizes and scales the compute resources for serverless tasks.
B.
Snowflake ensures only one instance of a task with a schedule (i.e. a standalone task or the root task in a DAG) is executed at a given time. If a task is still running when the next scheduled execution time occurs, then that scheduled time is skipped.
C.
Streams support repeatable read isolation.
D.
An standard-only stream tracks row inserts only.
Question