Data Science Asked by Omar Souaidi on January 7, 2021
I’m working on a project (court-related). At a certain point, I have to extract the reason of the legal compensation. For instance, let’s take these sentences (from a court report)
Order mister X to pay EUR 5000 for compensation for unpaid wages
and
To cover damages, mister X must pay EUR 4000 to mister Y
I want to make an algorithm that is able from this sentence to extract the motive of legal compensation. For the first sentence
Order mister X to pay EUR 5000 for compensation for unpaid wages
the algorithm’s output must be “compensation for unpaid wages” or “compensation unpaid wages “.
For the second sentence, the algorithm’s output must be “cover damages”. Output can be a string or a list of string, it doesn’t matter.
As I’m not an NLP expert (but I have already worked on a project on sentiment analysis, so I know some stuff about NLP), and there are so many articles, I don’t know where to start.
I’m working on French texts, but I can get away with working on English texts.
Can you clarify how consistent the structure of these sentences will be?
That said, you may be able to apply parts of speech (POS) tagging to isolate the right class of word in each sentence. I'm sure other options also exist, but I know NLTK accomplishes this task.
Answered by phaedrus on January 7, 2021
Get help from others!
Recent Answers
Recent Questions
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP