Login / Signup
Special Properties of Gradient Descent with Large Learning Rates.
Amirkeivan Mohtashami
Martin Jaggi
Sebastian U. Stich
Published in:
ICML (2023)
Keyphrases
</>
special properties
learning rate
error function
update rule
convergence rate
learning algorithm
gaussian kernels
cost function
convergence speed
covering numbers
uniform convergence
weight vector
loss function
convergence theorem