Keyphrases
- state information
- long term
- reinforcement learning
- action space
- state space
- action selection
- markov decision processes
- action models
- optimal policy
- orders of magnitude
- markov chain
- real valued
- markov decision process
- dynamic programming
- learning algorithm
- heuristic search
- transfer learning
- search strategies
- domain specific
- mobile robot
- decision making