Login / Signup

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration.

Christos DimitrakakisMichail G. Lagoudakis
Published in: EWRL (2008)
Keyphrases
  • worst case
  • learning algorithm
  • lower bound
  • upper bound
  • approximate policy iteration