Towards the Next 1000 Languages in Multilingual Machine Translation: Exploring the Synergy Between Supervised and Self-Supervised Learning.
Aditya SiddhantAnkur BapnaOrhan FiratYuan CaoMia Xu ChenIsaac CaswellXavier GarciaPublished in: CoRR (2022)
Keyphrases
- machine translation
- language independent
- cross lingual
- learning algorithm
- multilingual documents
- language resources
- grammar induction
- target language
- supervised learning
- cross language information retrieval
- language specific
- machine translation system
- statistical machine translation
- chinese english
- query translation
- cross lingual information retrieval
- natural language processing
- active learning
- multilingual information retrieval
- context free grammars
- information extraction
- linguistic resources
- comparable corpora
- feature selection
- machine learning
- domain dependent
- natural language
- digital libraries
- word sense disambiguation
- n gram