Login / Signup
Model-Free Least-Squares Policy Iteration.
Michail G. Lagoudakis
Ronald Parr
Published in:
NIPS (2001)
Keyphrases
</>
model free
reinforcement learning
policy iteration
reinforcement learning algorithms
function approximation
temporal difference
reinforcement learning methods
pattern recognition
policy evaluation