Login / Signup

Depth Dependence of μP Learning Rates in ReLU MLPs.

Samy JelassiBoris HaninZiwei JiSashank J. ReddiSrinadh BhojanapalliSanjiv Kumar
Published in: CoRR (2023)
Keyphrases