Planification robuste avec (L)RTDP.

Olivier Buffet Douglas Aberdeen

Published in: CAP (2005)

Keyphrases

heuristic search
markov decision processes
dynamic programming
larger problems
upper bound
control theory
real time dynamic programming
initial state
reinforcement learning methods
state space
beam search
search space
reinforcement learning
optimal policy
dynamical systems
search algorithm
average cost
lower bound
reinforcement learning algorithms
heuristic function
machine learning
planning problems
path finding
control problems
branch and bound
policy iteration
action space