An Improved On-line Algorithm for Learning Linear Evaluation Functions.

Published in: COLT (2000)

Keyphrases

learning algorithm
evaluation function
td learning
learning process
reinforcement learning
alpha beta
objective function
optimal solution
search algorithm
temporal difference learning
np hard
dynamic programming
model free