​
Login / Signup
Chunan Shi
ORCID
Publication Activity (10 Years)
Years Active: 2022-2024
Publications (10 Years): 6
Top Topics
Online Learning
Computationally Expensive
Parallel Architectures
Document Length
Top Venues
CoRR
ASPLOS (3)
Proc. VLDB Endow.
ASPLOS (2)
</>
Publications
</>
Xupeng Miao
,
Gabriele Oliaro
,
Zhihao Zhang
,
Xinhao Cheng
,
Zeyu Wang
,
Zhengxin Zhang
,
Rae Ying Yee Wong
,
Alan Zhu
,
Lijie Yang
,
Xiaoxiang Shi
,
Chunan Shi
,
Zhuoming Chen
,
Daiyaan Arfeen
,
Reyna Abhyankar
,
Zhihao Jia
SpecInfer: Accelerating Large Language Model Serving with Tree-based Speculative Inference and Verification.
ASPLOS (3)
(2024)
Xupeng Miao
,
Chunan Shi
,
Jiangfei Duan
,
Xiaoli Xi
,
Dahua Lin
,
Bin Cui
,
Zhihao Jia
SpotServe: Serving Generative Large Language Models on Preemptible Instances.
ASPLOS (2)
(2024)
Bin Xiao
,
Chunan Shi
,
Xiaonan Nie
,
Fan Yang
,
Xiangwei Deng
,
Lei Su
,
Weipeng Chen
,
Bin Cui
Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge.
CoRR
(2024)
Xupeng Miao
,
Chunan Shi
,
Jiangfei Duan
,
Xiaoli Xi
,
Dahua Lin
,
Bin Cui
,
Zhihao Jia
SpotServe: Serving Generative Large Language Models on Preemptible Instances.
CoRR
(2023)
Xupeng Miao
,
Yujie Wang
,
Youhe Jiang
,
Chunan Shi
,
Xiaonan Nie
,
Hailin Zhang
,
Bin Cui
Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism.
CoRR
(2022)
Xupeng Miao
,
Yujie Wang
,
Youhe Jiang
,
Chunan Shi
,
Xiaonan Nie
,
Hailin Zhang
,
Bin Cui
Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism.
Proc. VLDB Endow.
16 (3) (2022)