Login / Signup
TD (mu): A Modificaiton of TD (lambda) That Enables a Program to Learn Weights for Good Play Even if It Observes Only Bad Play.
Donald F. Beal
Published in:
JCIS (2002)
Keyphrases
</>
temporal difference
temporal difference learning
reinforcement learning
linear combination
game playing
learning algorithm
real world
machine learning
artificial neural networks
board game