Login / Signup
Learning structured reactive navigation plans from executing MDP navigation policies.
Michael Beetz
Thorsten Belker
Published in:
Agents (2001)
Keyphrases
</>
learning process
reinforcement learning
prior knowledge
state space
optimal policy
markov decision processes
machine learning
online learning
markov decision process
learning algorithm
learning systems
information space
partially observable