List of questions
Related questions
Question 208 - MLS-C01 discussion
A data engineer at a bank is evaluating a new tabular dataset that includes customer data. The data engineer will use the customer data to create a new model to predict customer behavior. After creating a correlation matrix for the variables, the data engineer notices that many of the 100 features are highly correlated with each other.
Which steps should the data engineer take to address this issue? (Choose two.)
A.
Use a linear-based algorithm to train the model.
B.
Apply principal component analysis (PCA).
C.
Remove a portion of highly correlated features from the dataset.
D.
Apply min-max feature scaling to the dataset.
E.
Apply one-hot encoding category-based variables.
Your answer:
0 comments
Sorted by
Leave a comment first