Momentum Q-learning with Finite-Sample Convergence Guarantee.
Bowen WengHuaqing XiongLin ZhaoYingbin LiangWei ZhangPublished in: CoRR (2020)
Keyphrases
- finite sample
- learning rate
- uniform convergence
- sample size
- convergence rate
- learning algorithm
- statistical learning theory
- error bounds
- nearest neighbor
- convergence speed
- gaussian kernels
- reinforcement learning
- parzen window
- generalization error
- machine learning
- generalization bounds
- state space
- density estimation
- neural network
- machine learning algorithms
- theoretical analysis
- model selection