Login / Signup
Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory.
Bin Hu
Usman Ahmed Syed
Published in:
CoRR (2019)
Keyphrases
</>
temporal difference learning algorithms
function approximation
markov chain
temporal difference learning
reinforcement learning
training set
dynamic programming