Login / Signup
Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration.
Christos Dimitrakakis
Michail G. Lagoudakis
Published in:
EWRL (2008)
Keyphrases
</>
worst case
learning algorithm
lower bound
upper bound
approximate policy iteration