New Versions of Gradient Temporal-Difference Learning.

Donghwan Lee Han-Dong Lim Jihoon Park Okyong Choi

Published in: IEEE Trans. Autom. Control. (2023)

Keyphrases

temporal difference learning
function approximation
fixed point
evaluation function
game playing
reinforcement learning
approximate value iteration
temporal difference
function approximators
monte carlo
reinforcement learning algorithms
markov decision process
learning algorithm
sufficient conditions
gaussian process