Strengthening the WiC: New Polysemy Dataset in Hindi and Lack of Cross Lingual Transfer.
Haim DubossarskyFarheen DairkeePublished in: LREC/COLING (2024)
Keyphrases
- cross lingual
- machine translation
- transfer learning
- indian languages
- cross language
- language modeling
- language independent
- cross lingual information retrieval
- event extraction
- word sense disambiguation
- text classification
- query translation
- statistical machine translation
- translation model
- parallel corpora
- document clustering
- linguistic resources
- parallel corpus
- source language
- mono lingual
- comparable corpora
- machine translation system
- language model
- natural language processing
- news articles
- retrieval model
- text mining
- data analysis