Comparison of named entity recognition tools for raw OCR text.
Kepa Joseba RodriquezMike BryantTobias BlankeMagdalena LuszczynskaPublished in: KONVENS (2012)
Keyphrases
- named entity recognition
- text summarization
- information extraction
- named entities
- natural language processing
- proper names
- named entity disambiguation
- named entity recognizer
- text recognition
- text mining
- semi supervised
- conditional random fields
- maximum entropy
- relation extraction
- text documents
- information retrieval
- printed documents
- document analysis
- optical character recognition
- classifier ensemble
- question answering
- annotated corpus
- similarity measure
- maximum entropy classifier
- databases
- character recognition
- keywords
- document images
- natural language
- data mining
- expert systems
- knowledge representation
- probabilistic model