Adapting Large Multilingual Machine Translation Models to Unseen Low Resource Languages via Vocabulary Substitution and Neuron Selection.
Mohamed AbdelghaffarAmr El MogyNada Ahmed SharafPublished in: AMTA (2022)
Keyphrases
- machine translation
- cross lingual
- language independent
- multilingual documents
- language resources
- target language
- language specific
- machine translation system
- cross language information retrieval
- cross lingual information retrieval
- statistical machine translation
- multilingual information retrieval
- chinese english
- query translation
- translation model
- natural language generation
- statistical translation models
- cross language
- natural language processing
- language processing
- probabilistic model
- information extraction
- parallel corpora
- word sense disambiguation
- comparable corpora
- text classification
- natural language
- word alignment
- language modeling
- parallel corpus
- co occurrence
- word level
- statistical model
- multilingual retrieval
- data mining