Login / Signup
Multigrid Methods for Policy Evaluation and Reinforcement Learning.
O. Ziv
Nahum Shimkin
Published in:
ISIC (2005)
Keyphrases
</>
reinforcement learning
model free
least squares
function approximation
temporal difference
support vector
dynamic programming
machine learning algorithms
markov decision processes
policy evaluation