Layer-wise Adaptive Gradient Sparsification for Distributed Deep Learning with Convergence Guarantees.
Shaohuai ShiZhenheng TangQiang WangKaiyong ZhaoXiaowen ChuPublished in: CoRR (2019)
Keyphrases
- deep learning
- restricted boltzmann machine
- unsupervised learning
- machine learning
- unsupervised feature learning
- weakly supervised
- pairwise
- deep architectures
- mental models
- pattern recognition
- graph cuts
- segmentation method
- generative model
- supervised learning
- object recognition
- reinforcement learning
- training data
- decision trees
- information retrieval