Login / Signup
Junbo Deng
Publication Activity (10 Years)
Years Active: 2024-2024
Publications (10 Years): 2
Top Topics
Language Modelling
Relevance Model
Test Collection
N Gram
Top Venues
USENIX ATC
CoRR
</>
Publications
</>
Bin Gao
,
Zhuomin He
,
Puru Sharma
,
Qingxuan Kang
,
Djordje Jevdjic
,
Junbo Deng
,
Xingkun Yang
,
Zhou Yu
,
Pengfei Zuo
Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention.
USENIX ATC
(2024)
Bin Gao
,
Zhuomin He
,
Puru Sharma
,
Qingxuan Kang
,
Djordje Jevdjic
,
Junbo Deng
,
Xingkun Yang
,
Zhou Yu
,
Pengfei Zuo
AttentionStore: Cost-effective Attention Reuse across Multi-turn Conversations in Large Language Model Serving.
CoRR
(2024)