Keyphrases
- simplex method
- strongly polynomial
- markov decision processes
- linear programming
- linear program
- stationary policies
- dynamic programming
- policy iteration
- optimal policy
- state space
- transition matrices
- average cost
- finite state
- reinforcement learning
- optimal solution
- primal dual
- decision theoretic planning
- column generation
- integer programming
- interior point methods
- minimum cost flow
- objective function
- partially observable
- average reward
- markov decision problems
- np hard
- markov decision process
- special case