Login / Signup
Towards Theoretically Understanding Why Sgd Generalizes Better Than Adam in Deep Learning.
Pan Zhou
Jiashi Feng
Chao Ma
Caiming Xiong
Steven Chu-Hong Hoi
Weinan E
Published in:
NeurIPS (2020)
Keyphrases
</>
deep learning
unsupervised learning
unsupervised feature learning
machine learning
restricted boltzmann machine
information retrieval
viewpoint
learning strategies
mental models
clustering algorithm
training set
pairwise