Revisiting Knowledge Distillation for Autoregressive Language Models.

Qihuang Zhong Liang Ding Li Shen Juhua Liu Bo Du Dacheng Tao

Published in: CoRR (2024)

Keyphrases

language model
autoregressive
language modeling
n gram
language modelling
probabilistic model
document retrieval
moving average
gaussian markov random field
retrieval model
non stationary
speech recognition
random fields
query expansion
statistical language models
information retrieval
prior knowledge
context sensitive
language models for information retrieval
document ranking
test collection
natural language