Login / Signup
MixKD: Towards Efficient Distillation of Large-scale Language Models.
Kevin J. Liang
Weituo Hao
Dinghan Shen
Yufan Zhou
Weizhu Chen
Changyou Chen
Lawrence Carin
Published in:
ICLR (2021)
Keyphrases
</>
language model
language modeling
probabilistic model
retrieval model
n gram
speech recognition
document retrieval
language modelling
statistical language models
query expansion
information retrieval
context sensitive
smoothing methods
test collection
document ranking