Login / Signup

AdaPlus: Integrating Nesterov Momentum and Precise Stepsize Adjustment on AdamW Basis.

Lei Guan
Published in: CoRR (2023)
Keyphrases
  • step size
  • convergence rate
  • cost function
  • approximate dynamic programming
  • quasi newton
  • learning rate
  • temporal difference
  • convex concave
  • semidefinite programming
  • faster convergence