Login / Signup
Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method.
Shicheng Tan
Weng Lam Tam
Yuanchun Wang
Wenwen Gong
Shu Zhao
Peng Zhang
Jie Tang
Published in:
ACL (Findings) (2023)
Keyphrases
</>
language model
unsupervised learning
probabilistic model
em algorithm
statistical model
bayesian networks
error rate
test collection
relevance model
statistical language models