n-Step Temporal Difference Learning with Optimal n.

Lakshmi Mandal Shalabh Bhatnagar

Published in: CoRR (2023)

Keyphrases

temporal difference learning
fixed point
function approximation
reinforcement learning
dynamic programming
game playing
approximate value iteration
optimal solution
worst case
temporal difference
sufficient conditions
optimal control
markov decision process
reinforcement learning algorithms