Overlap-based Vocabulary Generation Improves Cross-lingual Transfer Among Related Languages.
Vaidehi PatilPartha P. TalukdarSunita SarawagiPublished in: CoRR (2022)
Keyphrases
- cross lingual
- language independent
- transfer learning
- cross lingual information retrieval
- machine translation
- multi lingual
- language modeling
- event extraction
- european languages
- cross language
- parallel corpora
- out of vocabulary
- text classification
- document clustering
- query translation
- indian languages
- translation model
- word alignment
- parallel corpus
- statistical machine translation
- machine translation system
- language specific
- k means
- feature selection
- machine learning
- knn
- probabilistic model