Backstepping Temporal Difference Learning.

Han-Dong Lim Donghwan Lee

Published in: ICLR (2023)

Keyphrases

temporal difference learning
fixed point
function approximation
evaluation function
reinforcement learning
game playing
control strategy
temporal difference
markov decision process
approximate value iteration
reinforcement learning algorithms
sufficient conditions
monte carlo
belief propagation