Login / Signup

Towards Maximizing Nonlinear Delay Sensitive Rewards in Queuing Systems.

Sushmitha Shree SAvijit MandalAvhishek ChatterjeeKrishna P. Jagannathan
Published in: CoRR (2022)
Keyphrases
  • queuing systems
  • reinforcement learning
  • markov decision processes
  • single server
  • bandit problems
  • linear programming
  • steady state
  • multiarmed bandit