Building a Massive Corpus for Named Entity Recognition using Free Open Data Sources.
Daniel Specht MenezesPedro SavareseRuy Luiz MilidiúPublished in: CoRR (2019)
Keyphrases
- named entities
- annotated corpus
- data sources
- linguistic features
- person names
- named entity recognition
- noun phrases
- genia corpus
- news corpus
- relation extraction
- information extraction
- co occurrence
- named entity extraction
- natural language processing
- text mining
- question answering
- named entity disambiguation
- data model
- contextual features
- data sets
- unsupervised learning
- automatic annotation
- coreference resolution
- training set
- databases
- word sense
- artificial intelligence
- text documents
- data analysis
- semi supervised
- data mining
- maximum entropy model
- keywords
- database