Zap Q-Learning With Nonlinear Function Approximation.
Shuhang ChenAdithya M. DevrajAna BusicSean P. MeynPublished in: CoRR (2019)
Keyphrases
- function approximation
- reinforcement learning
- tile coding
- temporal difference learning algorithms
- state action space
- mountain car
- learning tasks
- radial basis function
- model free
- temporal difference learning
- temporal difference
- function approximators
- reinforcement learning algorithms
- td learning
- temporal difference methods
- learning algorithm
- machine learning
- neural network
- feature selection