Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models.
Yunfei TengWenbo GaoFrançois ChalusAnna ChoromanskaDonald GoldfarbAdrian WellerPublished in: NeurIPS (2019)
Keyphrases
- stochastic gradient descent
- learning models
- loss function
- multiple kernel learning
- pairwise
- least squares
- support vector
- matrix factorization
- machine learning
- learning algorithm
- semi supervised learning
- machine learning algorithms
- learning tasks
- random forests
- step size
- training set
- data sets
- online algorithms
- weight vector
- feature selection
- classification models
- learning problems
- similarity measure
- kernel methods
- conditional random fields
- support vector machine