Login / Signup
Labeled RTDP: Improving the Convergence of Real-Time Dynamic Programming.
Blai Bonet
Hector Geffner
Published in:
ICAPS (2003)
Keyphrases
</>
real time dynamic programming
markov decision processes
state space
reinforcement learning
finite state
dynamic programming
supervised learning
search algorithm
upper bound
markov chain
convergence rate