​
Login / Signup
Xinhao Cheng
ORCID
Publication Activity (10 Years)
Years Active: 2014-2024
Publications (10 Years): 5
Top Topics
Retrieval Model
Clique Tree
Language Modelling
Tensor Space
Top Venues
CoRR
ASPLOS (3)
VTC Spring
</>
Publications
</>
Xupeng Miao
,
Gabriele Oliaro
,
Zhihao Zhang
,
Xinhao Cheng
,
Zeyu Wang
,
Zhengxin Zhang
,
Rae Ying Yee Wong
,
Alan Zhu
,
Lijie Yang
,
Xiaoxiang Shi
,
Chunan Shi
,
Zhuoming Chen
,
Daiyaan Arfeen
,
Reyna Abhyankar
,
Zhihao Jia
SpecInfer: Accelerating Large Language Model Serving with Tree-based Speculative Inference and Verification.
ASPLOS (3)
(2024)
Mengdi Wu
,
Xinhao Cheng
,
Oded Padon
,
Zhihao Jia
A Multi-Level Superoptimizer for Tensor Programs.
CoRR
(2024)
Xupeng Miao
,
Gabriele Oliaro
,
Xinhao Cheng
,
Mengdi Wu
,
Colin Unger
,
Zhihao Jia
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning.
CoRR
(2024)
Xupeng Miao
,
Gabriele Oliaro
,
Zhihao Zhang
,
Xinhao Cheng
,
Zeyu Wang
,
Rae Ying Yee Wong
,
Zhuoming Chen
,
Daiyaan Arfeen
,
Reyna Abhyankar
,
Zhihao Jia
SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification.
CoRR
(2023)
Xupeng Miao
,
Gabriele Oliaro
,
Zhihao Zhang
,
Xinhao Cheng
,
Hongyi Jin
,
Tianqi Chen
,
Zhihao Jia
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems.
CoRR
(2023)
Kang-Hao Peng
,
Kwang-Cheng Chen
,
Shao-Lun Huang
,
Shao-Chou Hung
,
Xinhao Cheng
Green Traffic Compression in Wireless Sensor Networks.
VTC Spring
(2014)