Login / Signup
Adaptive Gradient Methods Converge Faster with Over-Parameterization (and you can do a line-search).
Sharan Vaswani
Frederik Kunstner
Issam H. Laradji
Si Yi Meng
Mark Schmidt
Simon Lacoste-Julien
Published in:
CoRR (2020)
Keyphrases
</>
density estimation
multiscale
lower bound
pairwise
supervised learning
parameter estimation
optimization algorithm
line search