AdaSAM: Boosting sharpness-aware minimization with adaptive learning rate and momentum for training deep neural networks.
Hao SunLi ShenQihuang ZhongLiang DingShixiang ChenJingwei SunJing LiGuangzhong SunDacheng TaoPublished in: Neural Networks (2024)