Policy iteration for customer-average performance optimization of closed queueing systems.
Li XiaXi ChenXi-Ren CaoPublished in: Autom. (2009)
Keyphrases
- queueing systems
- policy iteration
- queueing networks
- markov decision processes
- service times
- arrival rate
- model free
- long run
- average cost
- reinforcement learning
- markov processes
- optimal policy
- fixed point
- steady state
- temporal difference
- least squares
- control problems
- average reward
- heavy traffic
- infinite horizon
- single server
- large deviations
- optimal control
- linear programming
- probability distribution
- bayesian networks
- machine learning