Login / Signup
Backstepping Temporal Difference Learning.
Han-Dong Lim
Donghwan Lee
Published in:
ICLR (2023)
Keyphrases
</>
temporal difference learning
fixed point
function approximation
evaluation function
reinforcement learning
game playing
control strategy
temporal difference
markov decision process
approximate value iteration
reinforcement learning algorithms
sufficient conditions
monte carlo
belief propagation