Sign in

A Comparative Analysis of Task-Agnostic Distillation Methods for Compressing Transformer Language Models.

Takuma UdagawaAashka TrivediMichele MerlerBishwaranjan Bhattacharjee
Published in: CoRR (2023)
Keyphrases
  • language model
  • probabilistic model
  • context sensitive
  • language modeling
  • smoothing methods
  • information retrieval
  • document retrieval
  • search engine
  • test collection
  • statistical models