Login / Signup

Towards Maximizing Nonlinear Delay-Sensitive Rewards in Queuing Systems.

Sushmitha Shree SAvijit MandalAvhishek ChatterjeeKrishna P. Jagannathan
Published in: WiOpt (2023)
Keyphrases
  • queuing systems
  • reinforcement learning
  • markov decision processes
  • decision making
  • multiarmed bandit
  • neural network
  • graphical models
  • random variables