Planning in Observable POMDPs in Quasipolynomial Time.
Noah GolowichAnkur MoitraDhruv RohatgiPublished in: CoRR (2022)
Keyphrases
- partially observable markov decision processes
- stochastic domains
- partially observable
- belief space
- belief state
- reinforcement learning
- planning under uncertainty
- partially observable markov
- heuristic search
- markov decision processes
- planning problems
- dynamic programming
- predictive state representations
- domain independent
- state space
- blocks world
- action selection
- ai planning
- finite state
- machine learning
- neural network
- decision theoretic
- classical planning
- wide class
- motion planning
- dynamical systems
- partial observability
- optimal policy
- cut elimination
- multi agent
- partially observable stochastic games