Login / Signup

Cyclic and Randomized Stepsizes Invoke Heavier Tails in SGD.

Mert GürbüzbalabanYuanhan HuUmut SimsekliLingjiong Zhu
Published in: CoRR (2023)
Keyphrases
  • approximate dynamic programming
  • stochastic gradient descent
  • long tail
  • decision forest
  • machine learning
  • heavy tails
  • multi class
  • online learning
  • randomized algorithms