CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation.
Enda YuDezun DongYemao XuShuo OuyangXiangke LiaoPublished in: ICPP (2021)
Keyphrases
- stochastic gradient descent
- least squares
- loss function
- matrix factorization
- stochastic gradient
- step size
- random forests
- regularization parameter
- weight vector
- alternating least squares
- importance sampling
- image compression
- online algorithms
- image processing
- convergence speed
- linear combination
- multiple kernel learning
- model selection
- support vector machine
- machine learning