TransWikia.com

BIO tagging software

Data Science Asked by David Waterworth on March 1, 2021

I would like to label character data with BIO tags as part of an active learning process on unlabelled data. I am assuming there are open source GUI tools available which I can use to make this easier – i.e. present the string to be labeled and some way of tagging characters from a predefined set of tags (and probably allow new tags to be added).

I have not been able to find anything though – ideally cross-platform (Linux and macOS at a minimum). It could be a cloud service or AWS product but I’d prefer something which is locally installed.

Does anyone know of such a thing? Even something as simple as search term would help.

2 Answers

Prodigy is a tool for data annotation, I think it shines in NLP tasks.

I've also heard about Amazon mechanical turk, I know it can be used for annotation, but it's more general purpose.

Correct answer by David Masip on March 1, 2021

Another solution, without however the active learning support that prodigy offers, is doccano. It's open-source (free), easy to install and supports various applications, apart from NER (BIO tagging), also classification and translation.

Answered by Grzegorz on March 1, 2021

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP