On the mean-square rate of convergence of temporal-difference learning algorithms.

Vladislav B. Tadic

Published in: ACC (2002)

Keyphrases

temporal difference learning algorithms
function approximation
sufficient conditions
approximation error
convergence rate
number of iterations required
temporal difference learning
asymptotic properties
reinforcement learning
active learning
markov chain
semi supervised learning