Convergent Fitted Value Iteration with Linear Function Approximation.

Daniel J. Lizotte

Published in: NIPS (2011)

Keyphrases

function approximation
temporal difference learning algorithms
reinforcement learning
function approximators
model free
temporal difference learning
radial basis function
markov decision processes
learning tasks
temporal difference
optimal policy
policy iteration
state space
temporal difference methods
td learning
fixed point
reinforcement learning algorithms
markov decision process
linear combination
dynamic programming