Login / Signup
Parameter Averaging for SGD Stabilizes the Implicit Bias towards Flat Regions.
Atsushi Nitanda
Ryuhei Kikuchi
Shugo Maeda
Published in:
CoRR (2023)
Keyphrases
</>
parameter space
parameter values
stochastic gradient descent
input image
input parameters
machine learning
step size
salient features
homogeneous regions
update rule