Named Entity Recognition and Correction in OCRized Corpora (Détection et correction automatique d'entités nommées dans des corpus OCRisés) [in French].
Benoît SagotKata GáborPublished in: TALN (2) (2014)
Keyphrases
- named entity recognition
- annotated corpus
- natural language processing
- information extraction
- named entities
- maximum entropy
- relation extraction
- conditional random fields
- text summarization
- reference resolution
- named entity disambiguation
- semi supervised
- automatic annotation
- linguistic features
- higher order
- text corpora
- similarity measure
- multiword
- expert systems
- object recognition