Login / Signup

Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales.

Yiqun YaoYequan Wang
Published in: CoRR (2023)
Keyphrases