Keyphrases
- simplex method
- strongly polynomial
- markov decision processes
- linear program
- linear programming
- stationary policies
- dynamic programming
- policy iteration
- primal dual
- optimal policy
- average cost
- finite state
- transition matrices
- state space
- minimum cost flow
- column generation
- reinforcement learning
- interior point methods
- optimal solution
- integer programming
- average reward
- np hard
- partially observable
- objective function
- decision theoretic planning
- infinite horizon
- markov decision process
- reward function
- simulated annealing