Backstepping Temporal Difference Learning.

Han-Dong Lim Donghwan Lee

Published in: CoRR (2023)

Keyphrases

temporal difference learning
fixed point
function approximation
control strategy
evaluation function
reinforcement learning
game playing
temporal difference
approximate value iteration
markov decision process
monte carlo
reinforcement learning algorithms
neural network
computer games
learning experience
policy iteration
machine learning