Fold3D: Rethinking and Parallelizing Computational and Communicational Tasks in the Training of Large DNN Models.
Fanxin LiShixiong ZhaoYuhao QingXusheng ChenXiuxian GuanSen WangGong ZhangHeming CuiPublished in: IEEE Trans. Parallel Distributed Syst. (2023)