TransWikia.com

Twitter dataset to train word embeddings

Open Data Asked by Mugdha Pandya on September 29, 2021

I’m working on a project related to manipulating word embeddings. In order to do this, I need to train them myself on twitter data. Given Twitter’s policy, I am unable to find a suitable dataset. Does anyone have one or know where I can find one?

The dataset should:

  • contain public tweets
  • have no specific topic, just need lots of tweets
  • be pre-processed

One Answer

Tweet Sentiment Extraction

Kaggle supports a variety of dataset publication formats.

Kaggle Twitter Datasets

You need to have an account to download the datasets. You can find the code for preprocessing in the kernels tab.

Answered by Pluviophile on September 29, 2021

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP