Improving Temporal Difference Learning Performance in Backgammon Variants.

Nikolaos Papahristou Ioannis Refanidis

Published in: ACG (2011)

Keyphrases

temporal difference learning
function approximation
fixed point
evaluation function
reinforcement learning
game playing
approximate value iteration
temporal difference
reinforcement learning algorithms
markov decision process
policy iteration
neural network
monte carlo
learning outcomes
linear combination
probability distribution
image segmentation