Login / Signup
On the Computational Inefficiency of Large Batch Sizes for Stochastic Gradient Descent.
Noah Golmant
Nikita Vemuri
Zhewei Yao
Vladimir Feinberg
Amir Gholami
Kai Rothauge
Michael W. Mahoney
Joseph Gonzalez
Published in:
CoRR (2018)
Keyphrases
</>
stochastic gradient descent
online algorithms
least squares
matrix factorization
random forests
loss function
step size
support vector
support vector machine
regularization parameter
weight vector
learning algorithm