TransWikia.com

Building a text extractor to extract particular type of text from unstructured text data

Data Science Asked by Vivek Khetan on March 29, 2021

I have a lot of data and manually extracted annotations for the text. I was looking for any advice to automate the annotation extraction up to a good level of accuracy. Any advice is welcome.

One Answer

There is a spectrum of methods. On one end is the tried n' true method of performing pattern matching with rules. Write a series of regular expressions (i.e., regex). On the other end is the newer methods of weak supervision, a system that learns over time to get better. Snorkel one of implementation of weak supervision. Here is a Snorkel demo for identifying mentions of a chemical causing a disease from academic literature.

Answered by Brian Spiering on March 29, 2021

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP