Login / Signup
Representation Policy Iteration
Sridhar Mahadevan
Published in:
CoRR (2012)
Keyphrases
</>
policy iteration
markov decision processes
model free
reinforcement learning
least squares
sample path
bayesian networks
mathematical model
fixed point