​
Login / Signup
Xiafei Qiu
ORCID
Publication Activity (10 Years)
Years Active: 2018-2024
Publications (10 Years): 7
Top Topics
Markov Chain Monte Carlo
Pitman Yor Process
Bayesian Model
Dynamic Graph
Top Venues
Proc. VLDB Endow.
CoRR
Proc. ACM Manag. Data
ASPLOS (4)
</>
Publications
</>
Bin Lin
,
Tao Peng
,
Chen Zhang
,
Minmin Sun
,
Lanbo Li
,
Hanyu Zhao
,
Wencong Xiao
,
Qi Xu
,
Xiafei Qiu
,
Shen Li
,
Zhigang Ji
,
Yong Li
,
Wei Lin
Infinite-LLM: Efficient LLM Service for Long Context with DistAttention and Distributed KVCache.
CoRR
(2024)
Donglin Zhuang
,
Zhen Zheng
,
Haojun Xia
,
Xiafei Qiu
,
Junjie Bai
,
Wei Lin
,
Shuaiwen Leon Song
MonoNN: Enabling a New Monolithic Optimization Space for Neural Network Inference Tasks on Modern GPU-Centric Architectures.
OSDI
(2024)
Zhen Zheng
,
Zaifeng Pan
,
Dalin Wang
,
Kai Zhu
,
Wenyi Zhao
,
Tianyou Guo
,
Xiafei Qiu
,
Minmin Sun
,
Junjie Bai
,
Feng Zhang
,
Xiaoyong Du
,
Jidong Zhai
,
Wei Lin
BladeDISC: Optimizing Dynamic Shape Machine Learning Workloads via Compiler Approach.
Proc. ACM Manag. Data
1 (3) (2023)
Haojun Xia
,
Zhen Zheng
,
Yuchao Li
,
Donglin Zhuang
,
Zhongzhu Zhou
,
Xiafei Qiu
,
Yong Li
,
Wei Lin
,
Shuaiwen Leon Song
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity.
CoRR
(2023)
Haojun Xia
,
Zhen Zheng
,
Yuchao Li
,
Donglin Zhuang
,
Zhongzhu Zhou
,
Xiafei Qiu
,
Yong Li
,
Wei Lin
,
Shuaiwen Leon Song
Flash-LLM: Enabling Low-Cost and Highly-Efficient Large Generative Model Inference With Unstructured Sparsity.
Proc. VLDB Endow.
17 (2) (2023)
Zaifeng Pan
,
Zhen Zheng
,
Feng Zhang
,
Ruofan Wu
,
Hao Liang
,
Dalin Wang
,
Xiafei Qiu
,
Junjie Bai
,
Wei Lin
,
Xiaoyong Du
RECom: A Compiler Approach to Accelerating Recommendation Model Inference with Massive Embedding Columns.
ASPLOS (4)
(2023)
Xiafei Qiu
,
Wubin Cen
,
Zhengping Qian
,
You Peng
,
Ying Zhang
,
Xuemin Lin
,
Jingren Zhou
Real-time Constrained Cycle Detection in Large Dynamic Graphs.
Proc. VLDB Endow.
11 (12) (2018)