Iterate averaging as regularization for stochastic gradient descent.
Gergely NeuLorenzo RosascoPublished in: CoRR (2018)
Keyphrases
- stochastic gradient descent
- least squares
- loss function
- matrix factorization
- step size
- regularization parameter
- random forests
- early stopping
- support vector machine
- weight vector
- multiple kernel learning
- online algorithms
- importance sampling
- cross validation
- collaborative filtering
- pairwise
- decision trees
- support vector