Login / Signup

Robust Modified Policy Iteration.

David L. KaufmanAndrew J. Schaefer
Published in: INFORMS J. Comput. (2013)
Keyphrases
  • policy iteration
  • markov decision processes
  • model free
  • finite state
  • sample path
  • fixed point
  • temporal difference
  • reinforcement learning
  • least squares
  • neural network
  • optimal control
  • markov decision process