Login / Signup
Learning Partial Policies to Speedup MDP Tree Search via Reduction to I.I.D. Learning.
Jervis Pinto
Alan Fern
Published in:
J. Mach. Learn. Res. (2017)
Keyphrases
</>
reinforcement learning
tree search
cost function
temporal reasoning