Login / Signup
Natural Temporal Difference Learning.
William Dabney
Philip S. Thomas
Published in:
AAAI (2014)
Keyphrases
</>
temporal difference learning
function approximation
fixed point
evaluation function
reinforcement learning
game playing
temporal difference
approximate value iteration
markov decision process
reinforcement learning algorithms
model free
function approximators