Building Multilingual Corpora for a Complex Named Entity Recognition and Classification Hierarchy using Wikipedia and DBpedia.
Diego AlvesGaurish ThakkarGabriel AmaralTin KuculoMarko TadicPublished in: CoRR (2022)
Keyphrases
- named entity recognition
- natural language processing
- information extraction
- named entities
- annotated corpus
- pattern recognition
- classifier ensemble
- text summarization
- conditional random fields
- maximum entropy
- machine learning
- classification accuracy
- relation extraction
- classification algorithm
- feature space
- decision trees
- feature extraction
- knowledge base
- databases