SUARA: A scalable universal allreduce communication algorithm for acceleration of parallel deep learning applications.
Emin NuriyevRavi Reddy ManumachuSamar AseeriMahendra K. VermaAlexey L. LastovetskyPublished in: J. Parallel Distributed Comput. (2024)