SSD-SGD: Communication Sparsification for Distributed Deep Learning Training.

Published in: ACM Trans. Archit. Code Optim. (2023)

Keyphrases