Keyphrases
- markov decision processes
- state space
- heuristic search
- markov decision chains
- optimal policy
- stochastic shortest path
- markov decision process
- partially observable markov
- dynamic programming
- infinite horizon
- policy iteration
- artificial neural networks
- belief space
- partially observable markov decision processes
- average reward
- blind source separation
- learning algorithm