Sign in
SixthSense: Fast and Reliable Recognition of Dead Ends in MDPs.
Andrey Kolobov
Mausam
Daniel S. Weld
Published in:
AAAI (2010)
Keyphrases
</>
markov decision processes
reinforcement learning
dead ends
state space
optimal policy
objective function
multi dimensional
monte carlo
state variables