Login / Signup
An Improved On-line Algorithm for Learning Linear Evaluation Functions.
Peter Auer
Published in:
COLT (2000)
Keyphrases
</>
learning algorithm
evaluation function
td learning
learning process
reinforcement learning
alpha beta
objective function
optimal solution
search algorithm
temporal difference learning
np hard
dynamic programming
model free