Sign in

O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

Zhenyu ZhangYing ShengTianyi ZhouTianlong ChenLianmin ZhengRuisi CaiZhao SongYuandong TianChristopher RéClark W. BarrettZhangyang WangBeidi Chen
Published in: CoRR (2023)
Keyphrases