Login / Signup

Maximal Initial Learning Rates in Deep ReLU Networks.

Gaurav IyerBoris HaninDavid Rolnick
Published in: CoRR (2022)
Keyphrases
  • learning rate
  • learning algorithm
  • convergence rate
  • gaussian kernels
  • error function
  • convergence speed
  • hidden layer
  • uniform convergence
  • covering numbers
  • activation function
  • distance measure