Keyphrases
- optimal policy
- markov decision process
- state space
- markov decision processes
- reward function
- factored markov decision processes
- reinforcement learning
- partially observable
- machine learning
- markov decision problems
- utility function
- dynamic programming
- neural network
- infinite horizon
- standard deviation
- partially observable markov decision processes
- initial state
- control policies
- computational complexity
- revenue management
- discount factor