Login / Signup
Chendi Li
ORCID
Publication Activity (10 Years)
Years Active: 2021-2024
Publications (10 Years): 8
Top Topics
Main Contribution
Matrix Multiplication
Multiple Kernel
Graphics Processors
Top Venues
CoRR
ICS
ICPADS
HPCC/DSS/SmartCity/DependSys
</>
Publications
</>
Chendi Li
,
Yufan Xu
,
Sina Mahdipour Saravani
,
Ponnuswamy Sadayappan
Accelerated Auto-Tuning of GPU Kernels for Tensor Computations.
ICS
(2024)
Cunyang Wei
,
Haipeng Jia
,
Yunquan Zhang
,
Jianyu Yao
,
Chendi Li
,
Wenxuan Cao
IrGEMM: An Input-Aware Tuning Framework for Irregular GEMM on ARM and X86 CPUs.
IEEE Trans. Parallel Distributed Syst.
35 (9) (2024)
Tun Chen
,
Haipeng Jia
,
Yunquan Zhang
,
Kun Li
,
Zhihao Li
,
Xiang Zhao
,
Jianyu Yao
,
Chendi Li
OpenFFT: An Adaptive Tuning Framework for 3D FFT on ARM Multicore CPUs.
ICS
(2023)
Chendi Li
,
Haipeng Jia
,
Hang Cao
,
Jianyu Yao
,
Boqian Shi
,
Chunyang Xiang
,
Jinbo Sun
,
Pengqi Lu
,
Yunquan Zhang
AutoTSMM: An Auto-tuning Framework for Building High-Performance Tall-and-Skinny Matrix-Matrix Multiplication on CPUs.
CoRR
(2022)
Jianyu Yao
,
Boqian Shi
,
Chunyang Xiang
,
Haipeng Jia
,
Chendi Li
,
Hang Cao
,
Yunquan Zhang
IAAT: A Input-Aware Adaptive Tuning framework for Small GEMM.
CoRR
(2022)
Jianyu Yao
,
Boqian Shi
,
Chunyang Xiang
,
Haipeng Jia
,
Chendi Li
,
Hang Cao
,
Yunquan Zhang
IAAT: A Input-Aware Adaptive Tuning framework for Small GEMM.
ICPADS
(2021)
Chendi Li
,
Haipeng Jia
,
Hang Cao
,
Jianyu Yao
,
Boqian Shi
,
Chunyang Xiang
,
Jinbo Sun
,
Pengqi Lu
,
Yunquan Zhang
AutoTSMM: An Auto-tuning Framework for Building High-Performance Tall-and-Skinny Matrix-Matrix Multiplication on CPUs.
ISPA/BDCloud/SocialCom/SustainCom
(2021)
Tun Chen
,
Haipeng Jia
,
Zhihao Li
,
Chendi Li
,
Yunquan Zhang
A Transpose-free Three-dimensional FFT Algorithm on ARM CPUs.
HPCC/DSS/SmartCity/DependSys
(2021)