Keyphrases
- optimal policy
- markov decision processes
- finite horizon
- state space
- reinforcement learning
- decision problems
- dynamic programming
- decision trees
- long run
- infinite horizon
- average reward
- policy iteration
- finite state
- bayesian reinforcement learning
- state dependent
- multistage
- sufficient conditions
- markov decision process
- decision making
- average cost
- learning algorithm
- control policies
- markov decision problems
- search space
- periodic review