Periodic Stochastic Gradient Descent with Momentum for Decentralized Training.
Hongchang GaoHeng HuangPublished in: CoRR (2020)
Keyphrases
- stochastic gradient descent
- least squares
- early stopping
- matrix factorization
- loss function
- step size
- random forests
- learning rate
- regularization parameter
- support vector machine
- weight vector
- convergence rate
- multiple kernel learning
- online algorithms
- kernel methods
- pairwise
- linear svm
- objective function
- importance sampling
- training set
- machine learning methods
- monte carlo
- text categorization
- linear combination