Grokking as the Transition from Lazy to Rich Training Dynamics.
Tanishq KumarBlake BordelonSamuel J. GershmanCengiz PehlevanPublished in: CoRR (2023)
Keyphrases
- recurrent networks
- dynamical systems
- training examples
- training set
- computer vision
- training process
- training dataset
- high level
- initial conditions
- supervised learning
- database
- dynamic model
- training phase
- databases
- information systems
- structured prediction
- training algorithm
- small number
- image sequences
- decision trees