Login / Signup
Wonbeom Lee
Publication Activity (10 Years)
Years Active: 2024-2024
Publications (10 Years): 4
Top Topics
Test Collection
Cache Management
Retrieval Model
N Gram
Top Venues
CoRR
ISCA
OSDI
</>
Publications
</>
Wonbeom Lee
,
Jungi Lee
,
Junghwan Seo
,
Jaewoong Sim
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management.
OSDI
(2024)
Jungi Lee
,
Wonbeom Lee
,
Jaewoong Sim
Tender: Accelerating Large Language Models via Tensor Decomposition and Runtime Requantization.
ISCA
(2024)
Wonbeom Lee
,
Jungi Lee
,
Junghwan Seo
,
Jaewoong Sim
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management.
CoRR
(2024)
Jungi Lee
,
Wonbeom Lee
,
Jaewoong Sim
Tender: Accelerating Large Language Models via Tensor Decomposition and Runtime Requantization.
CoRR
(2024)