AutoTSMM: An Auto-tuning Framework for Building High-Performance Tall-and-Skinny Matrix-Matrix Multiplication on CPUs.
Chendi LiHaipeng JiaHang CaoJianyu YaoBoqian ShiChunyang XiangJinbo SunPengqi LuYunquan ZhangPublished in: ISPA/BDCloud/SocialCom/SustainCom (2021)