Login / Signup
Discovering temporally extended features for reinforcement learning in domains with delayed causalities.
Robert Lieck
Marc Toussaint
Published in:
ESANN (2015)
Keyphrases
</>
reinforcement learning
dynamic programming
learning algorithm
state space
markov decision processes
dynamic systems
decision theoretic planning
temporally extended