Login / Signup

Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective.

Wu LinFelix DangelRuna EschenhagenJuhan BaeRichard E. TurnerAlireza Makhzani
Published in: CoRR (2024)
Keyphrases
  • square root
  • machine learning methods
  • machine learning
  • computer vision