Automatic Generation of High-Performance Convolution Kernels on ARM CPUs for Deep Learning.
Jintao MengChen ZhuangPeng ChenMohamed WahibBertil SchmidtXiao WangHaidong LanDou WuMinwen DengYanjie WeiShengzhong FengPublished in: IEEE Trans. Parallel Distributed Syst. (2022)