Sign in

Model Compression and Efficient Inference for Large Language Models: A Survey.

Wenxiao WangWei ChenYicong LuoYongliu LongZhengkai LinLiye ZhangBinbin LinDeng CaiXiaofei He
Published in: CoRR (2024)
Keyphrases
  • language model
  • probabilistic model
  • efficient inference
  • relevance model
  • language modeling
  • prior knowledge
  • d objects
  • semi supervised
  • conditional random fields