Login / Signup
Model Compression and Efficient Inference for Large Language Models: A Survey.
Wenxiao Wang
Wei Chen
Yicong Luo
Yongliu Long
Zhengkai Lin
Liye Zhang
Binbin Lin
Deng Cai
Xiaofei He
Published in:
CoRR (2024)
Keyphrases
</>
language model
probabilistic model
efficient inference
relevance model
language modeling
prior knowledge
d objects
semi supervised
conditional random fields