Gradient Descent on Neural Networks Typically Occurs at the Edge of Stability.

Jeremy M. Cohen Simran Kaur Yuanzhi Li J. Zico Kolter Ameet Talwalkar

Published in: ICLR (2021)

Keyphrases