Login / Signup
Towards Maximizing Nonlinear Delay Sensitive Rewards in Queuing Systems.
Sushmitha Shree S
Avijit Mandal
Avhishek Chatterjee
Krishna P. Jagannathan
Published in:
CoRR (2022)
Keyphrases
</>
queuing systems
reinforcement learning
markov decision processes
single server
bandit problems
linear programming
steady state
multiarmed bandit