Login / Signup
Parallelizing Stochastic Gradient Descent for Least Squares Regression: Mini-batching, Averaging, and Model Misspecification.
Prateek Jain
Sham M. Kakade
Rahul Kidambi
Praneeth Netrapalli
Aaron Sidford
Published in:
J. Mach. Learn. Res. (2017)
Keyphrases
</>
stochastic gradient descent
loss function
least squares
matrix factorization
step size
random forests
multiple kernel learning
support vector machine
regularization parameter
support vector
active learning
small number
collaborative filtering
weight vector
importance sampling
feature space