Login / Signup
The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study.
Daniel S. Park
Jascha Sohl-Dickstein
Quoc V. Le
Samuel L. Smith
Published in:
ICML (2019)
Keyphrases
</>
stochastic gradient descent
least squares
step size
training data
lower bound
loss function