Login / Signup
A Convergent Form of Approximate Policy Iteration.
Theodore J. Perkins
Doina Precup
Published in:
NIPS (2002)
Keyphrases
</>
machine learning
np hard
dynamic programming