TransWikia.com

extraction information from resume

Data Science Asked by Zouhair Elhadi on November 1, 2020

I have a project in machine learning in which I need to analyze a
curriculum vitae. for that I have to write a python program. It uses
basic techniques of Natural Language Processing like word
parsing,chunking,reg ex parser. If you run the algorithm you can
easily capture information like name,email id,address,educational
qualification,experience in seconds from a large number of documents.
but I have a confusion between two methods LSTM (RNN) and NER? What do
you think is the best method?

3 Answers

LSTM is a neural network architecture used for sequence prediction, whereas NER is name of NLP task.

The two are incomparable, as one is a method and the other type of problem.

The relationship between them is that LSTMs can can be used for sequence tagging, which can be used for NER.

Answered by Jakub Bartczuk on November 1, 2020

I had worked on the same project for months. I want to say they both work well if you have enough labeled data. For those entities (likes: name,email id,address,educational qualification), Regular Express is enough good. For variance experiences, you need NER or DNN. If the number of date is small, NER is best.

You know that resume is semi-structured. And you can think the resume is combined by variance entities(likes: name, title, company, description text, date and so on), so you can combine them to expand your train dataset.

Answered by Happy Boy on November 1, 2020

I would suggest you to use spacy and add your own custom labels for NER.

https://spacy.io/usage/training/#ner

Answered by Madhur Yadav on November 1, 2020

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP