​
Login / Signup
Hao Zhang
ORCID
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 9
Top Topics
Word Error Rate
Pseudo Feedback
Social Intelligence Design
N Gram
Top Venues
CoRR
ICLR
EdgeFM@MobiSys
MLSys
</>
Publications
</>
Xiaoxuan Liu
,
Cade Daniel
,
Langxiang Hu
,
Woosuk Kwon
,
Zhuohan Li
,
Xiangxi Mo
,
Alvin Cheung
,
Zhijie Deng
,
Ion Stoica
,
Hao Zhang
Optimizing Speculative Decoding for Serving Large Language Models Using Goodput.
CoRR
(2024)
Daliang Xu
,
Hao Zhang
,
Liming Yang
,
Ruiqi Liu
,
Gang Huang
,
Mengwei Xu
,
Xuanzhe Liu
Empowering 1000 tokens/second on-device LLM prefilling with mllm-NPU.
CoRR
(2024)
Wei-Lin Chiang
,
Lianmin Zheng
,
Ying Sheng
,
Anastasios Nikolas Angelopoulos
,
Tianle Li
,
Dacheng Li
,
Hao Zhang
,
Banghua Zhu
,
Michael I. Jordan
,
Joseph E. Gonzalez
,
Ion Stoica
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference.
CoRR
(2024)
Yinmin Zhong
,
Shengyu Liu
,
Junda Chen
,
Jianbo Hu
,
Yibo Zhu
,
Xuanzhe Liu
,
Xin Jin
,
Hao Zhang
DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving.
OSDI
(2024)
Longfei Yun
,
Yonghao Zhuang
,
Yao Fu
,
Eric P. Xing
,
Hao Zhang
Toward Inference-optimal Mixture-of-Expert Large Language Models.
CoRR
(2024)
Jiangfei Duan
,
Runyu Lu
,
Haojie Duanmu
,
Xiuhong Li
,
Xingcheng Zhang
,
Dahua Lin
,
Ion Stoica
,
Hao Zhang
MuxServe: Flexible Multiplexing for Efficient Multiple LLM Serving.
CoRR
(2024)
Lianmin Zheng
,
Wei-Lin Chiang
,
Ying Sheng
,
Tianle Li
,
Siyuan Zhuang
,
Zhanghao Wu
,
Yonghao Zhuang
,
Zhuohan Li
,
Zi Lin
,
Eric P. Xing
,
Joseph E. Gonzalez
,
Ion Stoica
,
Hao Zhang
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset.
ICLR
(2024)
Daliang Xu
,
Hao Zhang
,
Liming Yang
,
Ruiqi Liu
,
Mengwei Xu
,
Xuanzhe Liu
WiP: Efficient LLM Prefilling with Mobile NPU.
EdgeFM@MobiSys
(2024)
Yonghao Zhuang
,
Lianmin Zheng
,
Zhuohan Li
,
Eric P. Xing
,
Qirong Ho
,
Joseph Gonzalez
,
Ion Stoica
,
Hao Zhang
,
Hexu Zhao
On Optimizing the Communication of Model Parallelism.
MLSys
(2023)