Login / Signup

Parallel Restarted SGD with Faster Convergence and Less Communication: Demystifying Why Model Averaging Works for Deep Learning.

Hao YuSen YangShenghuo Zhu
Published in: AAAI (2019)
Keyphrases