DENTRA: Denoising and Translation Pre-training for Multilingual Machine Translation.
Samta KambojSunil Kumar SahuNeha SenguptaPublished in: WMT (2022)
Keyphrases
- machine translation
- denoising
- cross lingual
- cross language information retrieval
- language resources
- language independent
- chinese english
- machine translation system
- language specific
- statistical machine translation
- multilingual documents
- parallel corpus
- target language
- cross lingual information retrieval
- query translation
- natural language processing
- natural language generation
- word alignment
- comparable corpora
- language processing
- image processing
- parallel corpora
- word sense disambiguation
- training corpus
- source language
- natural language
- multilingual information retrieval
- translation model
- cross language
- mt evaluation
- phrase based smt
- bilingual lexicon
- digital libraries
- tasks in natural language processing
- artificial intelligence
- cross language retrieval
- knowledge representation
- document images
- word level