Optimizing Stochastic Gradient Descent Using the Angle Between Gradients.
Chongya SongAlexander Perez-PonsKang K. YenPublished in: IEEE BigData (2020)
Keyphrases
- stochastic gradient descent
- least squares
- matrix factorization
- loss function
- step size
- random forests
- multiple kernel learning
- online algorithms
- support vector machine
- regularization parameter
- collaborative filtering
- weight vector
- importance sampling
- feature selection
- feature space
- support vector
- text categorization
- convergence speed
- feature extraction