Login / Signup

Gradient descent optimizes over-parameterized deep ReLU networks.

Difan ZouYuan CaoDongruo ZhouQuanquan Gu
Published in: Mach. Learn. (2020)
Keyphrases