Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging.
Peng LuIvan KobyzevMehdi RezagholizadehAhmad RashidAli GhodsiPhilippe LanglaisPublished in: CoRR (2022)
Keyphrases
- language model
- pre trained
- language modeling
- probabilistic model
- retrieval model
- document retrieval
- statistical language models
- n gram
- language modelling
- training data
- speech recognition
- test collection
- query expansion
- information retrieval
- training examples
- language models for information retrieval
- smoothing methods
- document ranking
- relevance model
- computer vision
- multi modal
- machine learning
- data sets