Login / Signup
The Anisotropic Noise in Stochastic Gradient Descent: Its Behavior of Escaping from Sharp Minima and Regularization Effects.
Zhanxing Zhu
Jingfeng Wu
Bing Yu
Lei Wu
Jinwen Ma
Published in:
ICML (2019)
Keyphrases
</>
stochastic gradient descent
least squares
loss function
matrix factorization
step size
early stopping
regularization parameter
random forests
support vector machine
missing data
multiple kernel learning
importance sampling
noise level
pairwise
noise reduction
multiscale