Versions of Gradient Temporal Difference Learning.

Donghwan Lee Han-Dong Lim Jihoon Park Okyong Choi

Published in: CoRR (2021)

Keyphrases

temporal difference learning
function approximation
fixed point
evaluation function
game playing
reinforcement learning
approximate value iteration
temporal difference
markov decision process
reinforcement learning algorithms
neural network
machine learning
sufficient conditions
monte carlo
markov decision processes