Building a Massive Corpus for Named Entity Recognition Using Free Open Data Sources.
Daniel Specht MenezesRuy MilidiúPedro SavaresePublished in: BRACIS (2019)
Keyphrases
- named entities
- annotated corpus
- data sources
- person names
- linguistic features
- news corpus
- named entity recognition
- noun phrases
- genia corpus
- information extraction
- co occurrence
- relation extraction
- natural language processing
- named entity extraction
- question answering
- named entity disambiguation
- contextual features
- personal names
- text mining
- automatic annotation
- data analysis
- maximum entropy model
- coreference resolution
- automatic extraction
- unsupervised learning
- text documents
- multiword
- proper names
- weakly supervised
- language model