Login / Signup
Training shallow ReLU networks on noisy data using hinge loss: when do we overfit and is it benign?
Erin George
Michael Murray
William Swartworth
Deanna Needell
Published in:
NeurIPS (2023)
Keyphrases
</>
noisy data
hinge loss
loss function
high dimensional
soft margin
missing data
risk minimization
missing values
training algorithm
training samples
input data
supervised learning
training data
convex optimization
training process
training set
training examples
model selection
binary classification
potential functions