TransWikia.com

Extract segment from document scan

Data Science Asked by feeeper on September 25, 2020

I need to extract some “valuable” information from document scan. For example, document’s number, incoming date, organizations, persons, etc.

Example document:

enter image description here

I’m trying to extract highlighted segment of the document. Original scan doesn’t have that highlighting. And value can be handwritten or typewritten.

I tried U-Net and Mask RCNN for my dataset (~100 examples). Without any success.

Any ideas?

One Answer

Priviet, feeper!

I created some simple program to extract data from documents. Works pretty well.

https://gist.github.com/fuwiak/780cb4abbe01aa5d1438269dfa0a3cfc

Best

Answered by fuwiak on September 25, 2020

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP