Sign in

Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization.

Xingxuan ZhangRenzhe XuHan YuHao ZouPeng Cui
Published in: CoRR (2023)
Keyphrases
  • objective function
  • convex functions
  • higher order
  • edge detection
  • decision trees
  • learning machines
  • horn clauses
  • gradient information
  • norm minimization
  • multiscale
  • support vector