Bilingual Dictionary-based Language Model Pretraining for Neural Machine Translation.
Yusen LinJiayong LinShuaicheng ZhangHaoying DaiPublished in: CoRR (2021)
Keyphrases
- machine translation
- language model
- cross language information retrieval
- translation model
- query translation
- chinese english
- parallel corpora
- statistical machine translation
- language modeling
- cross lingual
- n gram
- cross language retrieval
- query terms
- comparable corpora
- language independent
- information extraction
- speech recognition
- document retrieval
- query expansion
- word alignment
- language resources
- probabilistic model
- natural language processing
- retrieval model
- linguistic resources
- test collection
- word segmentation
- source language
- bilingual dictionaries
- out of vocabulary
- target language
- natural language
- parallel corpus
- bilingual lexicon
- statistical translation models
- information retrieval
- machine translation system
- pseudo relevance feedback
- context sensitive
- named entity recognition
- ad hoc information retrieval
- cross language
- machine learning
- multiword
- relevance model
- cross lingual information retrieval
- information retrieval systems