Reinforcement learning via kernel temporal difference.
Jihye BaePratik ChhatbarJoseph T. FrancisJustin C. SanchezJosé C. PríncipePublished in: EMBC (2011)
Keyphrases
- temporal difference
- reinforcement learning
- function approximation
- td learning
- model free
- evaluation function
- reinforcement learning algorithms
- temporal difference learning
- monte carlo
- actor critic
- kernel function
- policy evaluation
- action selection
- temporal difference methods
- step size
- policy iteration
- state space
- function approximators
- supervised learning
- feature space
- support vector
- td methods
- kernel methods
- neural network
- optimal control
- reinforcement learning methods
- markov decision processes
- machine learning