Transforming Wikipedia into Named Entity Training Data.
Joel NothmanJames R. CurranTara MurphyPublished in: ALTA (2008)
Keyphrases
- named entities
- training data
- named entity recognition
- information extraction
- co occurrence
- relation extraction
- natural language processing
- question answering
- linguistic features
- named entity extraction
- learning algorithm
- text documents
- training set
- text mining
- decision trees
- named entity disambiguation
- data sets
- maximum entropy model
- labeled data
- weakly supervised
- noun phrases
- unsupervised learning
- supervised learning
- annotated corpus
- proper names
- global context
- data analysis
- real world
- databases