Sign in

Mix-GEMM: An efficient HW-SW Architecture for Mixed-Precision Quantized Deep Neural Networks Inference on Edge Devices.

Enrico ReggianiAlessandro PappalardoMax DoblasMiquel MoretóMauro OlivieriOsman Sabri UnsalAdrián Cristal
Published in: HPCA (2023)
Keyphrases