Login / Signup

Stream-K: Work-Centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU.

Muhammad OsamaDuane MerrillCris CeckaMichael GarlandJohn D. Owens
Published in: PPoPP (2023)
Keyphrases