Login / Signup
The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study.
Daniel S. Park
Jascha Sohl-Dickstein
Quoc V. Le
Samuel L. Smith
Published in:
CoRR (2019)
Keyphrases
</>
stochastic gradient descent
least squares
matrix factorization
optimal solution
step size
random forests