Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism.
Vipul GuptaDhruv ChoudharyPing Tak Peter TangXiaohan WeiXing WangYuzhen HuangArun KejariwalKannan RamchandranMichael W. MahoneyPublished in: CoRR (2020)