Broad Twitter Corpus: A Diverse Named Entity Recognition Resource.
Leon DerczynskiKalina BontchevaIan RobertsPublished in: COLING (2016)
Keyphrases
- named entity recognition
- annotated corpus
- named entities
- information extraction
- natural language processing
- linguistic features
- reference resolution
- genia corpus
- conditional random fields
- maximum entropy
- named entity disambiguation
- semi supervised
- relation extraction
- text summarization
- sequence labeling
- pos tagging
- text mining
- real world
- proper names
- text documents
- automatic annotation
- classifier ensemble
- object recognition
- question answering
- knowledge discovery
- higher order
- maximum likelihood
- query expansion