ParaNames 1.0: Creating an Entity Name Corpus for 400+ Languages Using Wikidata.
Jonne SäleväConstantine LignosPublished in: LREC/COLING (2024)
Keyphrases
- coreference resolution
- expressive power
- entity extraction
- language independent
- statistical machine translation
- sentence pairs
- named entity disambiguation
- entity identification
- parallel corpora
- named entities
- grammatical inference
- databases
- multi lingual
- machine translation system
- comparable corpora
- language identification
- test set
- entity ranking
- news corpus
- natural language
- knowledge base
- syntactic and semantic dependencies
- machine learning