Named Entity Tagging a Very Large Unbalanced Corpus: Training and Evaluating NE Classifiers.
Joachim BingelThomas N. HaiderPublished in: LREC (2014)
Keyphrases
- named entities
- annotated corpus
- test set
- linguistic features
- person names
- noun phrases
- training set
- co training
- named entity recognition
- information extraction
- named entity extraction
- training examples
- relation extraction
- genia corpus
- question answering
- named entity disambiguation
- co occurrence
- training data
- text documents
- text mining
- natural language processing
- unsupervised learning
- training corpus
- contextual features
- supervised learning
- automatic annotation
- proper names
- decision trees
- feature set
- conditional random fields
- weakly supervised
- coreference resolution
- pattern recognition
- feature extraction