Efficient Multi-GPU Shared Memory via Automatic Optimization of Fine-Grained Transfers.

Published in: ISCA (2021)

Keyphrases