Tree-Structured Named Entity Recognition on OCR Data: Analysis, Processing and Results.
Marco DinarelliSophie RossetPublished in: LREC (2012)
Keyphrases
- named entity recognition
- data analysis
- information extraction
- named entities
- natural language processing
- text summarization
- semi supervised
- maximum entropy
- conditional random fields
- data mining
- relation extraction
- sequence labeling
- optical character recognition
- annotated corpus
- text mining
- classifier ensemble
- maximum entropy classifier
- document images
- computer vision
- information retrieval
- segmentation algorithm
- question answering
- active learning
- natural language
- training data
- machine learning