Technical Note - Elimination of Suboptimal Actions in Markov Decision Problems.
Richard C. GrinoldPublished in: Oper. Res. (1973)
Keyphrases
- markov decision problems
- partially observable
- decision theoretic
- decision processes
- state space
- reinforcement learning
- action space
- dynamical systems
- reward function
- linear programming
- optimal policy
- markov decision processes
- state transitions
- utility function
- decision problems
- action selection
- dynamic programming
- infinite horizon
- reasoning process
- real valued
- expected utility
- belief state
- transition probabilities
- learning algorithm
- linear program
- steady state
- bayesian networks