Data Science Asked on January 14, 2021
I have a couple of hundred categories where each of these categories has a specific set of attributes having different values (historical).
The problem I need to solve is to select the best set of categories out of a smaller group which meet some constraints.
I’m new to datascience and was wondering how do I go about solving this problem?
One option I thought was to use multiple regression for the different attributes to assign a weight to each category and then use these to generate a random forest on the historical groups of categories to train and test them.
Does this make sense?
If I understand your question correctly, the problem you are trying to solve is a "multiclass classification" problem, so I'd suggest doing some research on that. Possible solutions range from Support Vector Classification (simple) to deep neural networks (harder). In most machine learning problems, it's best to start with a simple approach. This means that:
I recommend checking out scipy's docs on classification models. Also if you're a beginner you might find this cheatsheet handy for similar questions in future.
Answered by timchap on January 14, 2021
Get help from others!
Recent Questions
Recent Answers
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP