TD (mu): A Modificaiton of TD (lambda) That Enables a Program to Learn Weights for Good Play Even if It Observes Only Bad Play.

Published in: JCIS (2002)

Keyphrases

temporal difference
temporal difference learning
reinforcement learning
linear combination
game playing
learning algorithm
real world
machine learning
artificial neural networks
board game