A Network-Centric Hardware/Algorithm Co-Design to Accelerate Distributed Training of Deep Neural Networks.
Youjie LiJongse ParkMohammad AlianYifan YuanZheng QuPeitian PanRen WangAlexander G. SchwingHadi EsmaeilzadehNam Sung KimPublished in: MICRO (2018)