Cross-lingual Named Entity Corpus for Slavic Languages.
Jakub PiskorskiMichal MarcinczukRoman YangarberPublished in: CoRR (2024)
Keyphrases
- cross lingual
- named entities
- annotated corpus
- parallel corpus
- mono lingual
- machine translation
- linguistic features
- language independent
- parallel corpora
- named entity recognition
- natural language processing
- information extraction
- statistical machine translation
- word sense
- noun phrases
- web news
- cross lingual information retrieval
- language modeling
- text mining
- relation extraction
- text classification
- translation model
- machine translation system
- cross language
- question answering
- co occurrence
- text documents
- query translation
- document clustering
- unsupervised learning
- transfer learning
- news articles
- semantic role labeling
- information retrieval
- target language
- source language
- language model
- semantic roles
- labeled data
- bilingual dictionaries
- semantic relations
- probabilistic model
- data analysis
- natural language
- training data