Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration.

Christos Dimitrakakis Michail G. Lagoudakis

Published in: EWRL (2008)

Keyphrases

worst case
learning algorithm
lower bound
upper bound
approximate policy iteration