Login / Signup

Approximate Policy Iteration with Bellman Residuals Minimization.

Gennaro EspositoMario Martín
Published in: CCIA (2014)
Keyphrases
  • least squares
  • approximate policy iteration
  • policy iteration
  • linear program
  • policy search
  • learning algorithm
  • reinforcement learning
  • objective function
  • state space
  • neural network