Login / Signup
Approximate Policy Iteration with Bellman Residuals Minimization.
Gennaro Esposito
Mario Martín
Published in:
CCIA (2014)
Keyphrases
</>
least squares
approximate policy iteration
policy iteration
linear program
policy search
learning algorithm
reinforcement learning
objective function
state space
neural network