AdaShift: Decorrelation and Convergence of Adaptive Learning Rate Methods.

Zhiming ZhouQingru ZhangGuansong LuHongwei WangWeinan ZhangYong Yu
Published in: ICLR (Poster) (2019)