Optimizing Multi-GPU Parallelization Strategies for Deep Learning Training.
Saptadeep PalEiman EbrahimiArslan ZulfiqarYaosheng FuVictor ZhangSzymon MigaczDavid W. NellansPuneet GuptaPublished in: CoRR (2019)
Keyphrases
- deep learning
- deep architectures
- restricted boltzmann machine
- unsupervised learning
- parallel processing
- unsupervised feature learning
- machine learning
- weakly supervised
- deep belief networks
- supervised learning
- mental models
- training samples
- higher order
- viewpoint
- training set
- computer vision
- input image
- text mining
- d objects
- pairwise