A Survey on Efficient Inference for Large Language Models.
Zixuan ZhouXuefei NingKe HongTianyu FuJiaming XuShiyao LiYuming LouLuning WangZhihang YuanXiuhong LiShengen YanGuohao DaiXiao-Ping ZhangYuhan DongYu WangPublished in: CoRR (2024)
Keyphrases
- information extraction
- efficient inference
- language model
- conditional random fields
- information retrieval
- language modeling
- probabilistic model
- probabilistic inference
- fully connected
- n gram
- structured prediction
- query expansion
- exact inference
- markov random field
- hidden variables
- graph structure
- graphical models
- approximate inference
- markov networks
- human pose estimation
- language models for information retrieval
- smoothing methods
- linear models
- latent variables
- factor graphs
- higher order
- support vector machine
- hidden markov models