Keyphrases
- reward shaping
- reinforcement learning
- reinforcement learning algorithms
- complex domains
- function approximation
- state space
- machine learning
- model free
- markov decision problems
- belief functions
- optimal policy
- belief revision
- partially observable
- learning algorithm
- decision makers
- markov decision processes
- optimal control
- belief state
- dynamic programming
- learning process