Login / Signup
Deep linear networks can benignly overfit when shallow ones do.
Niladri S. Chatterji
Philip M. Long
Published in:
J. Mach. Learn. Res. (2023)
Keyphrases
</>
neural network
social networks
multiscale
question answering
network design
information extraction
closed form
complex networks
wall street journal
bayesian networks
natural language processing
network model
piecewise linear
linear systems
probabilistic networks
network size