Login / Signup
Towards Maximizing Nonlinear Delay-Sensitive Rewards in Queuing Systems.
Sushmitha Shree S
Avijit Mandal
Avhishek Chatterjee
Krishna P. Jagannathan
Published in:
WiOpt (2023)
Keyphrases
</>
queuing systems
reinforcement learning
markov decision processes
decision making
multiarmed bandit
neural network
graphical models
random variables