​
Login / Signup
Jiangfei Duan
ORCID
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 10
Top Topics
Ad Hoc Information Retrieval
Retrieval Model
N Gram
Training Phase
Top Venues
CoRR
ASPLOS (3)
ASPLOS (2)
NSDI
</>
Publications
</>
Jiangfei Duan
,
Shuo Zhang
,
Zerui Wang
,
Lijuan Jiang
,
Wenwen Qu
,
Qinghao Hu
,
Guoteng Wang
,
Qizhen Weng
,
Hang Yan
,
Xingcheng Zhang
,
Xipeng Qiu
,
Dahua Lin
,
Yonggang Wen
,
Xin Jin
,
Tianwei Zhang
,
Peng Sun
Efficient Training of Large Language Models on Distributed Infrastructures: A Survey.
CoRR
(2024)
Xupeng Miao
,
Chunan Shi
,
Jiangfei Duan
,
Xiaoli Xi
,
Dahua Lin
,
Bin Cui
,
Zhihao Jia
SpotServe: Serving Generative Large Language Models on Preemptible Instances.
ASPLOS (2)
(2024)
Qianchao Zhu
,
Jiangfei Duan
,
Chang Chen
,
Siran Liu
,
Xiuhong Li
,
Guanyu Feng
,
Xin Lv
,
Huanqi Cao
,
Xiao Chuanfu
,
Xingcheng Zhang
,
Dahua Lin
,
Chao Yang
SampleAttention: Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention.
CoRR
(2024)
Haojie Duanmu
,
Zhihang Yuan
,
Xiuhong Li
,
Jiangfei Duan
,
Xingcheng Zhang
,
Dahua Lin
SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models.
CoRR
(2024)
Jiangfei Duan
,
Runyu Lu
,
Haojie Duanmu
,
Xiuhong Li
,
Xingcheng Zhang
,
Dahua Lin
,
Ion Stoica
,
Hao Zhang
MuxServe: Flexible Multiplexing for Efficient Multiple LLM Serving.
CoRR
(2024)
Jiangfei Duan
,
Ziang Song
,
Xupeng Miao
,
Xiaoli Xi
,
Dahua Lin
,
Harry Xu
,
Minjia Zhang
,
Zhihao Jia
Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances.
NSDI
(2024)
Chang Chen
,
Xiuhong Li
,
Qianchao Zhu
,
Jiangfei Duan
,
Peng Sun
,
Xingcheng Zhang
,
Chao Yang
Centauri: Enabling Efficient Scheduling for Communication-Computation Overlap in Large Model Training via Communication Partitioning.
ASPLOS (3)
(2024)
Jiangfei Duan
,
Ziang Song
,
Xupeng Miao
,
Xiaoli Xi
,
Dahua Lin
,
Harry Xu
,
Minjia Zhang
,
Zhihao Jia
Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances.
CoRR
(2024)
Jiangfei Duan
,
Xiuhong Li
,
Ping Xu
,
Xingcheng Zhang
,
Shengen Yan
,
Yun Liang
,
Dahua Lin
Proteus: Simulating the Performance of Distributed DNN Training.
CoRR
(2023)
Xupeng Miao
,
Chunan Shi
,
Jiangfei Duan
,
Xiaoli Xi
,
Dahua Lin
,
Bin Cui
,
Zhihao Jia
SpotServe: Serving Generative Large Language Models on Preemptible Instances.
CoRR
(2023)