Cross Validated Asked by Vineet on November 12, 2021
I need some general advice and possible ideas.
Problem statement goes like this —
We are given a tweet and we have to specify associated labels for it like generalized hate, support, oppose, refutation, allegation, sarcasm.
The training data is ~6k tweets. However, there is very high class imbalance. Almost 90% of classes have 0s and rest are one.
The approach I have tried:
What I figured out that the BERT vocab is not recognizing most of the words and mapping it to zero.
What other approaches should I try? Any leads are appreciated.
Get help from others!
Recent Answers
Recent Questions
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP