Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory.

Bin Hu Usman Ahmed Syed

Published in: CoRR (2019)

Keyphrases

temporal difference learning algorithms
function approximation
markov chain
temporal difference learning
reinforcement learning
training set
dynamic programming