TransWikia.com

Financial Corpus

Open Data Asked by Tomi Sargiotto on November 20, 2021

I’ll be delving into text mining applications for my master thesis and I need data for it. Ideally, I would need a corpora of texts/news articles from some single (or multiple) credible and authoritative source covering financial markets/economy and the like, spanning a time period as long as possible.

Does anyone know of datasets/corporas that would suit my requirements?
Any other possibilities/ideas to construct one such dataset using open resources?

I know this question has been already asked but the answers that were posted are not longer valid.

One Answer

Reuters Financial Dataset as a structured DataFrame

Reuters Financial Dataset is a large collection of Financial News Article scraped from Reuters website. Originally used for the paper Using Structured Events to Predict Stock Price Movement:An Empirical Investigation - Ding et al.(2014) this set of unstructured data is a powerful warehouse of historic Financial Data. This script provides a way of arranging the huge corpus of information into a Pandas' efficient data structure DataFrame

Originally, this repository consisted of badly written Python script which was monolitic and cryptic. This refactor breaks the code down into smaller functions and comes equipped with a function to create the DataFrame.

Answered by Pluviophile on November 20, 2021

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP