Login / Signup

Parameter Averaging for SGD Stabilizes the Implicit Bias towards Flat Regions.

Atsushi NitandaRyuhei KikuchiShugo Maeda
Published in: CoRR (2023)
Keyphrases
  • parameter space
  • parameter values
  • stochastic gradient descent
  • input image
  • input parameters
  • machine learning
  • step size
  • salient features
  • homogeneous regions
  • update rule