Data Science Asked by Dima on February 3, 2021
I am preparing classification model. Many of numeric variables are positives skewed. Should I change a distribution of variables to be more Gaussian?
What model are you using?
Ideally, one should always scale / normalize data before feeding them into some ML / statistical model. By using Z-scores you should be able to control for skewness of your variables.
Answered by Leevo on February 3, 2021
Data does not necessarily have to be standardized and mainly from the model, which we want to use.
Normality it's in many cases an asumption. In this situation normality means that the error between the predictions and the actual answers is distributed normally.
Answered by fuwiak on February 3, 2021
Get help from others!
Recent Questions
Recent Answers
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP