Cross-lingual Named Entity Corpus for Slavic Languages.
Jakub PiskorskiMichal MarcinczukRoman YangarberPublished in: LREC/COLING (2024)
Keyphrases
- cross lingual
- named entities
- annotated corpus
- parallel corpus
- mono lingual
- linguistic features
- machine translation
- web news
- parallel corpora
- information extraction
- statistical machine translation
- named entity recognition
- word sense
- natural language processing
- language independent
- cross lingual information retrieval
- noun phrases
- language modeling
- co occurrence
- relation extraction
- text mining
- text classification
- cross language
- translation model
- question answering
- machine translation system
- news articles
- text documents
- information retrieval
- unsupervised learning
- document clustering
- language model
- query translation
- source language
- transfer learning
- bilingual dictionaries
- keywords
- semi supervised