Cross Validated Asked by Eugen Cuic on November 14, 2021
Hope I got it right, as this is my first active post 🙂
I was trying to find a solution the whole day for my problem.
I am trying to predict a continuos variable based on 20 different predictors. The predicted variable is an average m’3 per transportation unit across all palets. the 20 different predictors are the ordered volume per department.
The data looks like this:
You can see the predictors in the columns VS01 – VS20. Lots of them are zero, because not every order buys something from a department.
In order to create a linear regression I was trying to normalize data with different approaches.
What I’ve found is that the solutions for zero inflated issues are focusing only on the predicted value, not on the predictors.
So I am trying to ask the community, if you have another approach to try out?
Bellow you can find the 2nd picutre about 2 of the department’s volume from the orders. Almost all of them look like this:
Additional what would be nice to understand a bit more is how to transform such a non linear distribution into something normal distributed
What I would also be open, if someone could suggest a different approach than linear regression. Maybe:
Thanks a lot
Get help from others!
Recent Answers
Recent Questions
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP