An Open Corpus for Named Entity Recognition in Historic Newspapers.
Clemens NeudeckerPublished in: LREC (2016)
Keyphrases
- named entity recognition
- information extraction
- named entities
- natural language processing
- text summarization
- maximum entropy
- semi supervised
- conditional random fields
- web pages
- relation extraction
- hand coded
- sequence labeling
- annotated corpus
- co occurrence
- information retrieval
- classifier ensemble
- maximum entropy classifier
- generative model
- graphical models
- text mining