Login / Signup
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima.
Nitish Shirish Keskar
Dheevatsa Mudigere
Jorge Nocedal
Mikhail Smelyanskiy
Ping Tak Peter Tang
Published in:
ICLR (2017)
Keyphrases
</>
deep learning
deep architectures
restricted boltzmann machine
unsupervised learning
machine learning
deep belief networks
unsupervised feature learning
training set
online learning
mental models
computer vision
decision making
face recognition
feature extraction
multi class