Login / Signup

(S)GD over Diagonal Linear Networks: Implicit Regularisation, Large Stepsizes and Edge of Stability.

Mathieu EvenScott PesmeSuriya GunasekarNicolas Flammarion
Published in: CoRR (2023)
Keyphrases