Computing Optimal Policies for Controlled Tandem Queueing Systems.
Katsuhisa OhnoKuniyoshi IchikiPublished in: Oper. Res. (1987)
Keyphrases
- queueing systems
- optimal policy
- markov decision processes
- decision problems
- long run
- queueing networks
- arrival rate
- reinforcement learning
- finite horizon
- dynamic programming
- state space
- markov processes
- control problems
- finite state
- state dependent
- average reward
- average reward reinforcement learning
- multistage
- heavy traffic
- infinite horizon
- asymptotically optimal
- queue length
- single server
- initial state
- steady state
- sufficient conditions
- large deviations
- data mining
- serial inventory systems
- decision making
- queueing model
- policy iteration
- special case