SGD Learns Over-parameterized Networks that Provably Generalize on Linearly Separable Data.

Alon BrutzkusAmir GlobersonEran MalachShai Shalev-Shwartz
Published in: ICLR (Poster) (2018)