Asynchronous SGD with stale gradient dynamic adjustment for deep learning training.
Tao TanHong XieYunni XiaXiaoyu ShiMingsheng ShangPublished in: Inf. Sci. (2024)
Keyphrases
- deep learning
- deep architectures
- restricted boltzmann machine
- unsupervised learning
- unsupervised feature learning
- stochastic gradient descent
- online learning
- deep belief networks
- training set
- machine learning
- bayesian networks
- supervised learning
- image processing
- training samples
- weakly supervised
- decision making
- computer vision
- data sets