Login / Signup
A Comparative Analysis of Task-Agnostic Distillation Methods for Compressing Transformer Language Models.
Takuma Udagawa
Aashka Trivedi
Michele Merler
Bishwaranjan Bhattacharjee
Published in:
CoRR (2023)
Keyphrases
</>
language model
probabilistic model
context sensitive
language modeling
smoothing methods
information retrieval
document retrieval
search engine
test collection
statistical models