Towards Scalable Distributed Training of Deep Learning on Public Cloud Clusters.
Shaohuai ShiXianhao ZhouShutao SongXingyao WangZilin ZhuXue HuangXinan JiangFeihu ZhouZhenyu GuoLiqiang XieRui LanXianbin OuyangYan ZhangJieqian WeiJing GongWeiliang LinPing GaoPeng MengXiaomin XuChenyang GuoBo YangZhibo ChenYongjian WuXiaowen ChuPublished in: MLSys (2021)
Keyphrases
- deep learning
- scalable distributed
- deep architectures
- restricted boltzmann machine
- unsupervised learning
- unsupervised feature learning
- machine learning
- clustering algorithm
- deep belief networks
- data points
- training samples
- weakly supervised
- learning algorithm
- mental models
- supervised learning
- training set
- file system
- training examples
- viewpoint
- similarity measure