C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Michael Wyatt
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 4
Top Topics
Structured Prediction
Low Latency
Proteomic Data
Text Generation
Top Venues
CoRR
</>
Publications
</>
Connor Holmes
,
Masahiro Tanaka
,
Michael Wyatt
,
Ammar Ahmad Awan
,
Jeff Rasley
,
Samyam Rajbhandari
,
Reza Yazdani Aminabadi
,
Heyang Qin
,
Arash Bakhtiari
,
Lev Kurilenko
,
Yuxiong He
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference.
CoRR
(2024)
Haojun Xia
,
Zhen Zheng
,
Xiaoxia Wu
,
Shiyang Chen
,
Zhewei Yao
,
Stephen Youn
,
Arash Bakhtiari
,
Michael Wyatt
,
Donglin Zhuang
,
Zhongzhu Zhou
,
Olatunji Ruwase
,
Yuxiong He
,
Shuaiwen Leon Song
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design.
CoRR
(2024)
Zhewei Yao
,
Reza Yazdani Aminabadi
,
Olatunji Ruwase
,
Samyam Rajbhandari
,
Xiaoxia Wu
,
Ammar Ahmad Awan
,
Jeff Rasley
,
Minjia Zhang
,
Conglong Li
,
Connor Holmes
,
Zhongzhu Zhou
,
Michael Wyatt
,
Molly Smith
,
Lev Kurilenko
,
Heyang Qin
,
Masahiro Tanaka
,
Shuai Che
,
Shuaiwen Leon Song
,
Yuxiong He
DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales.
CoRR
(2023)
Xiaoxia Wu
,
Haojun Xia
,
Stephen Youn
,
Zhen Zheng
,
Shiyang Chen
,
Arash Bakhtiari
,
Michael Wyatt
,
Reza Yazdani Aminabadi
,
Yuxiong He
,
Olatunji Ruwase
,
Leon Song
,
Zhewei Yao
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks.
CoRR
(2023)