Online POMDP Planning with Anytime Deterministic Guarantees.
Moran BarenboimVadim IndelmanPublished in: NeurIPS (2023)
Keyphrases
- planning problems
- stochastic domains
- belief space
- partially observable markov decision processes
- fully observable
- partially observable
- state space
- online learning
- partially observable stochastic domains
- travel planning
- partially observable markov decision process
- reinforcement learning
- machine learning
- decision theoretic
- markov decision processes
- planning under uncertainty
- domain independent
- heuristic search
- deterministic domains
- real time
- belief state
- single agent
- hidden markov models
- point based value iteration