Dynamics of Temporal Difference Learning.

Andreas Wendemuth

Published in: IJCAI (2007)

Keyphrases

temporal difference learning
function approximation
fixed point
evaluation function
reinforcement learning
game playing
approximate value iteration
temporal difference
dynamical systems
reinforcement learning algorithms
markov decision process
neural network
sufficient conditions
monte carlo