Parle: parallelizing stochastic gradient descent.
Pratik ChaudhariCarlo BaldassiRiccardo ZecchinaStefano SoattoAmeet TalwalkarPublished in: CoRR (2017)
Keyphrases
- stochastic gradient descent
- least squares
- loss function
- matrix factorization
- step size
- random forests
- support vector machine
- weight vector
- multiple kernel learning
- online algorithms
- regularization parameter
- feature extraction
- importance sampling
- machine learning
- random forest
- logistic regression
- collaborative filtering
- training set