Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign.
Olivier GalibertSophie RossetCyril GrouinPierre ZweigenbaumLudovic QuintardPublished in: LREC (2012)
Keyphrases
- named entities
- annotated corpus
- person names
- text corpus
- text documents
- news corpus
- named entity recognition
- relation extraction
- information extraction
- named entity extraction
- co occurrence
- global context
- noun phrases
- text mining
- linguistic features
- natural language processing
- semantic classes
- automatic annotation
- question answering
- genia corpus
- information retrieval
- text collections
- text corpora
- document clustering
- metadata
- unsupervised learning
- personal names
- relevant documents
- contextual features
- named entity disambiguation
- image annotation
- training corpus
- automatic summarization
- information retrieval systems
- natural language
- artificial intelligence