Robustification of Multilingual Language Models to Real-world Noise with Robust Contrastive Pretraining.
Asa Cooper SticklandSailik SenguptaJason KroneSaab MansourHe HePublished in: CoRR (2022)
Keyphrases
- language model
- language modeling
- probabilistic model
- n gram
- speech recognition
- cross lingual
- statistical language models
- information retrieval
- test collection
- language modelling
- query expansion
- document retrieval
- retrieval model
- ad hoc information retrieval
- context sensitive
- document length
- digital libraries
- smoothing methods
- query terms
- pseudo relevance feedback
- relevance model
- document ranking
- language independent
- cross language
- vector space model
- okapi bm
- natural language processing
- language models for information retrieval