Login / Signup
Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective.
Wu Lin
Felix Dangel
Runa Eschenhagen
Juhan Bae
Richard E. Turner
Alireza Makhzani
Published in:
CoRR (2024)
Keyphrases
</>
square root
machine learning methods
machine learning
computer vision