Login / Signup
On the mean-square rate of convergence of temporal-difference learning algorithms.
Vladislav B. Tadic
Published in:
ACC (2002)
Keyphrases
</>
temporal difference learning algorithms
function approximation
sufficient conditions
approximation error
convergence rate
number of iterations required
temporal difference learning
asymptotic properties
reinforcement learning
active learning
markov chain
semi supervised learning