Breaking (Global) Barriers in Parallel Stochastic Optimization With Wait-Avoiding Group Averaging.
Shigang LiTal Ben-NunGiorgi NadiradzeSalvatore Di GirolamoNikoli DrydenDan AlistarhTorsten HoeflerPublished in: IEEE Trans. Parallel Distributed Syst. (2021)