O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
Zhenyu ZhangYing ShengTianyi ZhouTianlong ChenLianmin ZhengRuisi CaiZhao SongYuandong TianChristopher RéClark W. BarrettZhangyang WangBeidi ChenPublished in: CoRR (2023)
Keyphrases
- language model
- language modeling
- n gram
- probabilistic model
- document retrieval
- test collection
- statistical language models
- language modelling
- mixture model
- language models for information retrieval
- bag of words
- query expansion
- generative model
- web search
- language modeling framework
- feature selection
- retrieval model
- query terms
- text retrieval
- context sensitive
- document length
- query processing
- ad hoc information retrieval