Login / Signup
Ruibo Fan
ORCID
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 6
Top Topics
Graphics Hardware
Matrix Multiplication
Avoid Overfitting
General Purpose
Top Venues
CoRR
IPDPS
ASPLOS (3)
Signal Process.
</>
Publications
</>
Weile Luo
,
Ruibo Fan
,
Zeyu Li
,
Dayou Du
,
Qiang Wang
,
Xiaowen Chu
Benchmarking and Dissecting the Nvidia Hopper GPU Architecture.
IPDPS
(2024)
Ruibo Fan
,
Wei Wang
,
Xiaowen Chu
DTC-SpMM: Bridging the Gap in Accelerating General Sparse Matrix Multiplication with Tensor Cores.
ASPLOS (3)
(2024)
Ruibo Fan
,
Mingli Jing
,
Jingang Shi
,
Lan Li
,
Zizhao Wang
TVRPCA+: Low-rank and sparse decomposition based on spectral norm and structural sparsity-inducing norm.
Signal Process.
217 (2024)
Weile Luo
,
Ruibo Fan
,
Zeyu Li
,
Dayou Du
,
Qiang Wang
,
Xiaowen Chu
Benchmarking and Dissecting the Nvidia Hopper GPU Architecture.
CoRR
(2024)
Longteng Zhang
,
Xiang Liu
,
Zeyu Li
,
Xinglin Pan
,
Peijie Dong
,
Ruibo Fan
,
Rui Guo
,
Xin Wang
,
Qiong Luo
,
Shaohuai Shi
,
Xiaowen Chu
Dissecting the Runtime Performance of the Training, Fine-tuning, and Inference of Large Language Models.
CoRR
(2023)
Ruibo Fan
,
Wei Wang
,
Xiaowen Chu
Fast Sparse GPU Kernels for Accelerated Training of Graph Neural Networks.
IPDPS
(2023)