REDUCT: Keep it Close, Keep it Cool! : Efficient Scaling of DNN Inference on Multi-core CPUs with Near-Cache Compute.
Anant V. NoriRahul BeraShankar BalachandranJoydeep RakshitOm Ji OmerAvishaii AbuhatzeraBelliappa KuttannaSreenivas SubramoneyPublished in: ISCA (2021)