Should All Temporal Difference Learning Use Emphasis?

Xiang Gu Sina Ghiassian Richard S. Sutton

Published in: CoRR (2019)

Keyphrases

temporal difference learning
function approximation
evaluation function
fixed point
reinforcement learning
game playing
approximate value iteration
temporal difference
reinforcement learning algorithms
markov decision process
monte carlo
linear combination
sufficient conditions
least squares
function approximators
state space
multi agent