Finding Optimal POMDP Controllers Using Quadratically Constrained Linear Programs.
Christopher AmatoDaniel S. BernsteinShlomo ZilbersteinPublished in: AI&M (2006)
Keyphrases
- finding optimal
- linear program
- interior point methods
- reinforcement learning
- linear programming
- semi infinite
- optimal solution
- stochastic programming
- dynamic programming
- column generation
- linear programming problems
- objective function
- primal dual
- mixed integer
- partially observable markov decision processes
- markov decision problems
- state space
- simplex method
- finite state
- extreme points
- belief state
- partially observable
- integer program
- optimal policy
- mixed integer linear program
- market equilibrium
- markov decision process
- interior point
- convex functions
- semidefinite programming
- reward function
- markov chain
- np hard