Acceleration via Fractal Learning Rate Schedules.
Naman AgarwalSurbhi GoelCyril ZhangPublished in: ICML (2021)
Keyphrases
- learning rate
- convergence rate
- learning algorithm
- fractal dimension
- scheduling problem
- image compression
- convergence speed
- error function
- adaptive learning rate
- rapid convergence
- multilayer neural networks
- training algorithm
- weight vector
- convergence theorem
- hidden layer
- activation function
- delta bar delta
- natural gradient
- reinforcement learning