Login / Signup
SGD with AdaGrad Stepsizes: Full Adaptivity with High Probability to Unknown Parameters, Unbounded Gradients and Affine Variance.
Amit Attia
Tomer Koren
Published in:
ICML (2023)
Keyphrases
</>
wide range
low variance
conditional probabilities
standard deviation
approximate dynamic programming
data sets
machine learning
image sequences
dynamic programming
image registration
linear combination
high precision
covariance matrix
geometric transformations
variance reduction