Login / Signup
True Online Temporal-Difference Learning.
Harm van Seijen
Ashique Rupam Mahmood
Patrick M. Pilarski
Marlos C. Machado
Richard S. Sutton
Published in:
J. Mach. Learn. Res. (2016)
Keyphrases
</>
temporal difference learning
function approximation
reinforcement learning
fixed point
evaluation function
approximate value iteration
game playing
temporal difference
markov decision process
machine learning
search space