Generalization Bounds of Stochastic Gradient Descent for Wide and Deep Neural Networks.
Yuan CaoQuanquan GuPublished in: CoRR (2019)
Keyphrases
- stochastic gradient descent
- neural network
- least squares
- loss function
- matrix factorization
- data dependent
- pattern recognition
- random forests
- artificial neural networks
- generalization ability
- weight vector
- multiple kernel learning
- back propagation
- support vector machine
- vc dimension
- bp neural network
- learning problems
- feature set
- upper bound
- active learning
- feature extraction