Login / Signup
Mitigating Planner Overfitting in Model-Based Reinforcement Learning.
Dilip Arumugam
David Abel
Kavosh Asadi
Nakul Gopalan
Christopher Grimm
Jun Ki Lee
Lucas Lehnert
Michael L. Littman
Published in:
CoRR (2018)
Keyphrases
</>
model based reinforcement learning
markov decision processes
reinforcement learning
domain independent
decision trees
heuristic search
initial state
finite state
machine learning
optimal policy
average cost
markov decision problems