Small Pre-trained Language Models Can be Fine-tuned as Large Models via Over-Parameterization.
Ze-Feng GaoKun ZhouPeiyu LiuWayne Xin ZhaoJi-Rong WenPublished in: ACL (1) (2023)
Keyphrases
- language model
- probabilistic model
- language modelling
- statistical language models
- language modeling
- pre trained
- fine tuned
- smoothing methods
- n gram
- document retrieval
- relevance model
- speech recognition
- translation model
- neural network
- query expansion
- information retrieval
- general purpose
- face recognition
- machine learning