Improving blocked matrix-matrix multiplication routine by utilizing AVX-512 instructions on intel knights landing and xeon scalable processors.

Published in: Clust. Comput. (2023)

Keyphrases