Login / Signup

Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning.

Libin ZhuChaoyue LiuAdityanarayanan RadhakrishnanMikhail Belkin
Published in: CoRR (2023)
Keyphrases