Analysing Wikipedia and Gold-Standard Corpora for NER Training.
Joel NothmanTara MurphyJames R. CurranPublished in: EACL (2009)
Keyphrases
- gold standard
- named entities
- natural language processing
- named entity recognition
- semi automatic
- ground truth
- annotated corpus
- maximum entropy
- manual segmentation
- information extraction
- wordnet
- registration accuracy
- text mining
- text summarization
- named entity recognizer
- link structure
- chinese named entity recognition
- named entity disambiguation
- mechanical turk
- text corpus
- active learning
- domain specific
- question answering