Optimal Policies for ATM Cell Scheduling and Rejection.
Erol GelenbeVijay SrinivasanSridhar SeshadriNatarajan GautamPublished in: Telecommun. Syst. (2001)
Keyphrases
- optimal policy
- markov decision processes
- decision problems
- finite horizon
- scheduling problem
- priority scheduling
- state space
- multistage
- infinite horizon
- reinforcement learning
- dynamic programming
- long run
- finite state
- state dependent
- average reward reinforcement learning
- average reward
- long run average cost
- average cost
- serial inventory systems
- production planning
- sufficient conditions
- control policies
- dynamic programming algorithms
- resource allocation
- policy iteration
- bayesian reinforcement learning
- markov chain
- markov decision process
- partially observable markov decision processes
- semi markov decision processes
- lost sales
- search algorithm