Effect of skewness in data

Question

I am preparing classification model. Many of numeric variables are positives skewed. Should I change a distribution of variables to be more Gaussian?

Leevo · Answer

What model are you using?

Ideally, one should always scale / normalize data before feeding them into some ML / statistical model. By using Z-scores you should be able to control for skewness of your variables.

fuwiak · Answer

Data does not necessarily have to be standardized and
mainly from the model, which we want to use.

Normality it's in many cases an asumption.
In this situation normality means that the error between the predictions and the actual answers is distributed normally.

Effect of skewness in data

2 Answers

Add your own answers!

Ask a Question