Data Science Asked by Naveen Kumar on May 4, 2021
I have very high dimensional data. Almost 20% of the columns has different value in less than 1% of rows. All of these are binary columns and many columns has 0s filled in more than almost 98% of rows.
Some more info:
Target variable is an imbalanced(91.9%:8.1%) binary variable.
Every variable I have, except 3, are binary.
I would like some ideas on how to deal with columns like this? drop them or smote to have more data?
Thanks in advance.
Get help from others!
Recent Questions
Recent Answers
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP