ExamGecko
Question list
Search
Search

List of questions

Search

Related questions











Question 241 - MLS-C01 discussion

Report
Export

A data scientist is building a linear regression model. The scientist inspects the dataset and notices that the mode of the distribution is lower than the median, and the median is lower than the mean.

Which data transformation will give the data scientist the ability to apply a linear regression model?

A.
Exponential transformation
Answers
A.
Exponential transformation
B.
Logarithmic transformation
Answers
B.
Logarithmic transformation
C.
Polynomial transformation
Answers
C.
Polynomial transformation
D.
Sinusoidal transformation
Answers
D.
Sinusoidal transformation
Suggested answer: B

Explanation:

A logarithmic transformation is a suitable data transformation for a linear regression model when the data has a skewed distribution, such as when the mode is lower than the median and the median is lower than the mean. A logarithmic transformation can reduce the skewness and make the data more symmetric and normally distributed, which are desirable properties for linear regression. A logarithmic transformation can also reduce the effect of outliers and heteroscedasticity (unequal variance) in the data. An exponential transformation would have the opposite effect of increasing the skewness and making the data more asymmetric. A polynomial transformation may not be able to capture the nonlinearity in the data and may introduce multicollinearity among the transformed variables. A sinusoidal transformation is not appropriate for data that does not have a periodic pattern.

References:

Data Transformation - Scaler Topics

Linear Regression - GeeksforGeeks

Linear Regression - Scribbr

asked 16/09/2024
Jasper Fons
38 questions
User
Your answer:
0 comments
Sorted by

Leave a comment first