Analyzing the Mono- and Cross-Lingual Pretraining Dynamics of Multilingual Language Models.
Terra BlevinsHila GonenLuke ZettlemoyerPublished in: CoRR (2022)
Keyphrases
- cross lingual
- language modeling
- language model
- cross lingual information retrieval
- information retrieval
- translation model
- pseudo feedback
- language independent
- retrieval model
- probabilistic model
- n gram
- query expansion
- cross language
- document retrieval
- test collection
- parallel corpus
- parallel corpora
- query terms
- machine translation
- context sensitive
- relevance model
- semi supervised
- word segmentation
- pseudo relevance feedback
- query translation
- text classification
- statistical machine translation
- retrieval effectiveness
- smoothing methods
- information retrieval systems
- text retrieval