Login / Signup
n-Step Temporal Difference Learning with Optimal n.
Lakshmi Mandal
Shalabh Bhatnagar
Published in:
CoRR (2023)
Keyphrases
</>
temporal difference learning
fixed point
function approximation
reinforcement learning
dynamic programming
game playing
approximate value iteration
optimal solution
worst case
temporal difference
sufficient conditions
optimal control
markov decision process
reinforcement learning algorithms