Differential Temporal Difference Learning.

Adithya M. Devraj Ioannis Kontoyiannis Sean P. Meyn

Published in: IEEE Trans. Autom. Control. (2021)

Keyphrases

temporal difference learning
fixed point
function approximation
game playing
evaluation function
approximate value iteration
reinforcement learning
temporal difference
reinforcement learning algorithms
monte carlo
policy iteration
function approximators
machine learning
markov decision process
learning algorithm
learning outcomes
machine learning algorithms
least squares
cost function