Login / Signup
Stephen Youn
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 6
Top Topics
Matrix Completion
Language Model
Low Rank Matrices
Query Expansion
Top Venues
CoRR
USENIX ATC
AAAI
</>
Publications
</>
Haojun Xia
,
Zhen Zheng
,
Xiaoxia Wu
,
Shiyang Chen
,
Zhewei Yao
,
Stephen Youn
,
Arash Bakhtiari
,
Michael Wyatt
,
Donglin Zhuang
,
Zhongzhu Zhou
,
Olatunji Ruwase
,
Yuxiong He
,
Shuaiwen Leon Song
Quant-LLM: Accelerating the Serving of Large Language Models via FP6-Centric Algorithm-System Co-Design on Modern GPUs.
USENIX ATC
(2024)
Zhewei Yao
,
Xiaoxia Wu
,
Cheng Li
,
Stephen Youn
,
Yuxiong He
Exploring Post-training Quantization in LLMs from Comprehensive Study to Low Rank Compensation.
AAAI
(2024)
Haojun Xia
,
Zhen Zheng
,
Xiaoxia Wu
,
Shiyang Chen
,
Zhewei Yao
,
Stephen Youn
,
Arash Bakhtiari
,
Michael Wyatt
,
Donglin Zhuang
,
Zhongzhu Zhou
,
Olatunji Ruwase
,
Yuxiong He
,
Shuaiwen Leon Song
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design.
CoRR
(2024)
Zhewei Yao
,
Reza Yazdani Aminabadi
,
Stephen Youn
,
Xiaoxia Wu
,
Elton Zheng
,
Yuxiong He
ZeroQuant-HERO: Hardware-Enhanced Robust Optimized Post-Training Quantization Framework for W8A8 Transformers.
CoRR
(2023)
Zhewei Yao
,
Cheng Li
,
Xiaoxia Wu
,
Stephen Youn
,
Yuxiong He
A Comprehensive Study on Post-Training Quantization for Large Language Models.
CoRR
(2023)
Xiaoxia Wu
,
Haojun Xia
,
Stephen Youn
,
Zhen Zheng
,
Shiyang Chen
,
Arash Bakhtiari
,
Michael Wyatt
,
Reza Yazdani Aminabadi
,
Yuxiong He
,
Olatunji Ruwase
,
Leon Song
,
Zhewei Yao
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks.
CoRR
(2023)