Login / Signup
SGD Learns Over-parameterized Networks that Provably Generalize on Linearly Separable Data.
Alon Brutzkus
Amir Globerson
Eran Malach
Shai Shalev-Shwartz
Published in:
CoRR (2017)
Keyphrases
</>
data sets
data points
training data
data distribution
nearest neighbor
linearly separable
face recognition
pattern recognition
data analysis
covariance matrix
pattern classification