TransWikia.com

Effect of skewness in data

Data Science Asked by Dima on February 3, 2021

I am preparing classification model. Many of numeric variables are positives skewed. Should I change a distribution of variables to be more Gaussian?

2 Answers

What model are you using?

Ideally, one should always scale / normalize data before feeding them into some ML / statistical model. By using Z-scores you should be able to control for skewness of your variables.

Answered by Leevo on February 3, 2021

Data does not necessarily have to be standardized and mainly from the model, which we want to use.

Normality it's in many cases an asumption. In this situation normality means that the error between the predictions and the actual answers is distributed normally.

Answered by fuwiak on February 3, 2021

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP