Dynamic Programming for One-Sided Partially Observable Pursuit-Evasion Games.
Karel HorákBranislav BosanskýPublished in: CoRR (2016)
Keyphrases
- partially observable
- pursuit evasion
- dynamic programming
- state space
- infinite horizon
- markov decision processes
- markov decision problems
- reinforcement learning
- dynamical systems
- decision problems
- optimal policy
- partial observability
- belief state
- optimal control
- dec pomdps
- partially observable environments
- partial observations
- game theory
- partially observable domains
- finite state
- action models
- stereo matching
- linear programming
- state variables
- partially observable markov decision processes
- long run
- reinforcement learning algorithms
- nash equilibrium
- markov decision process
- markov chain
- average cost
- particle filter
- search space
- search algorithm
- machine learning