Assessing and Minimizing the Impact of OCR Quality on Named Entity Recognition.
Ahmed HamdiAxel Jean-CaurantNicolas SidèreMickaël CoustatyAntoine DoucetPublished in: TPDL (2020)
Keyphrases
- named entity recognition
- information extraction
- named entities
- natural language processing
- maximum entropy
- semi supervised
- conditional random fields
- relation extraction
- text summarization
- hand coded
- classifier ensemble
- document images
- pairwise
- proper names
- sequence labeling
- optical character recognition
- question answering
- supervised learning
- maximum entropy classifier
- higher order
- prior knowledge
- learning environment