ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 249 - MLS-C01 discussion

Report
Export

An obtain relator collects the following data on customer orders: demographics, behaviors, location, shipment progress, and delivery time. A data scientist joins all the collected datasets. The result is a single dataset that includes 980 variables.

The data scientist must develop a machine learning (ML) model to identify groups of customers who are likely to respond to a marketing campaign.

Which combination of algorithms should the data scientist use to meet this requirement? (Select TWO.)

A.
Latent Dirichlet Allocation (LDA)
Answers
A.
Latent Dirichlet Allocation (LDA)
B.
K-means
Answers
B.
K-means
C.
Se mantic feg mentation
Answers
C.
Se mantic feg mentation
D.
Principal component analysis (PCA)
Answers
D.
Principal component analysis (PCA)
E.
Factorization machines (FM)
Answers
E.
Factorization machines (FM)
Suggested answer: B, D

Explanation:

The data scientist should useK-meansandprincipal component analysis (PCA)to meet this requirement. K-means is a clustering algorithm that can group customers based on their similarity in the feature space. PCA is a dimensionality reduction technique that can transform the original 980 variables into a smaller set of uncorrelated variables that capture most of the variance in the data. This can help reduce the computational cost and noise in the data, and improve the performance of the clustering algorithm.

References:

Clustering - Amazon SageMaker

Dimensionality Reduction - Amazon SageMaker

asked 16/09/2024
Marcelo Tamaki
36 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first