Login / Signup
Learning and Planning with Timing Information in Markov Decision Processes.
Pierre-Luc Bacon
Borja Balle
Doina Precup
Published in:
UAI (2015)
Keyphrases
</>
markov decision processes
reinforcement learning
partially observable
macro actions
state space
stochastic games
optimal policy
learning algorithm
supervised learning
heuristic search
finite state
decision theoretic planning
model based reinforcement learning
data mining
learning tasks