Partially Observable Online Contingent Planning Using Landmark Heuristics.
Shlomi MaliahRonen I. BrafmanErez KarpasGuy ShaniPublished in: ICAPS (2014)
Keyphrases
- partially observable
- state space
- decision problems
- reinforcement learning
- dynamical systems
- markov decision processes
- classical planning
- partial observability
- markov decision problems
- heuristic search
- belief space
- belief state
- planning domains
- infinite horizon
- partial observations
- partially observable environments
- action models
- heuristic function
- search algorithm
- planning problems
- partially observable markov decision process
- plan existence
- initially unknown
- partially observable domains
- dynamic programming
- optimal planning
- reward function
- ai planning
- bayesian networks