Reinforcement learning via kernel temporal difference.

Jihye Bae Pratik Chhatbar Joseph T. Francis Justin C. Sanchez José C. Príncipe

Published in: EMBC (2011)

Keyphrases

temporal difference
reinforcement learning
function approximation
td learning
model free
evaluation function
reinforcement learning algorithms
temporal difference learning
monte carlo
actor critic
kernel function
policy evaluation
action selection
temporal difference methods
step size
policy iteration
state space
function approximators
supervised learning
feature space
support vector
td methods
kernel methods
neural network
optimal control
reinforcement learning methods
markov decision processes
machine learning