Learning Rate Schedules in the Presence of Distribution Shift.
Matthew FahrbachAdel JavanmardVahab MirrokniPratik WorahPublished in: CoRR (2023)
Keyphrases
- learning rate
- convergence rate
- learning algorithm
- hidden layer
- rapid convergence
- error function
- convergence speed
- multilayer neural networks
- training algorithm
- convergence theorem
- delta bar delta
- scheduling problem
- weight vector
- uniform convergence
- adaptive learning rate
- genetic algorithm
- dynamic programming
- np hard
- evolutionary algorithm
- data mining