Sign in
Yinmin Zhong
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 5
Top Topics
Translation Model
Language Models For Information Retrieval
Smoothing Methods
N Gram
Top Venues
CoRR
ASPLOS (2)
OSDI
</>
Publications
</>
Yinmin Zhong
,
Shengyu Liu
,
Junda Chen
,
Jianbo Hu
,
Yibo Zhu
,
Xuanzhe Liu
,
Xin Jin
,
Hao Zhang
DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving.
CoRR
(2024)
Zhuohan Li
,
Lianmin Zheng
,
Yinmin Zhong
,
Vincent Liu
,
Ying Sheng
,
Xin Jin
,
Yanping Huang
,
Zhifeng Chen
,
Hao Zhang
,
Joseph E. Gonzalez
,
Ion Stoica
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving.
OSDI
(2023)
Diandian Gu
,
Yihao Zhao
,
Yinmin Zhong
,
Yifan Xiong
,
Zhenhua Han
,
Peng Cheng
,
Fan Yang
,
Gang Huang
,
Xin Jin
,
Xuanzhe Liu
ElasticFlow: An Elastic Serverless Training Platform for Distributed Deep Learning.
ASPLOS (2)
(2023)
Zhuohan Li
,
Lianmin Zheng
,
Yinmin Zhong
,
Vincent Liu
,
Ying Sheng
,
Xin Jin
,
Yanping Huang
,
Zhifeng Chen
,
Hao Zhang
,
Joseph E. Gonzalez
,
Ion Stoica
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving.
CoRR
(2023)
Bingyang Wu
,
Yinmin Zhong
,
Zili Zhang
,
Gang Huang
,
Xuanzhe Liu
,
Xin Jin
Fast Distributed Inference Serving for Large Language Models.
CoRR
(2023)