Temporal Difference Learning and TD-Gammon.

Published in: Commun. ACM (1995)

Keyphrases

temporal difference learning
temporal difference
function approximation
fixed point
evaluation function
reinforcement learning
game playing
approximate value iteration
reinforcement learning algorithms
monte carlo
markov decision process
function approximators
model free
state space
dynamic programming
bayesian networks
machine learning