Computing Solutions in Infinite-Horizon Discounted Adversarial Patrolling Games.
Yevgeniy VorobeychikBo AnMilind TambeSatinder P. SinghPublished in: ICAPS (2014)
Keyphrases
- infinite horizon
- finite horizon
- optimal policy
- long run
- optimal control
- dynamic programming
- multi agent
- stochastic demand
- markov decision processes
- single item
- production planning
- state space
- leader follower
- markov decision process
- partially observable
- average cost
- lead time
- dec pomdps
- decision making
- fixed cost
- optimal solution
- policy iteration
- game theory
- holding cost
- periodic review
- inventory policy
- single product
- reinforcement learning