• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning.

Libin ZhuChaoyue LiuAdityanarayanan RadhakrishnanMikhail Belkin
Published in: CoRR (2023)
Keyphrases