Allocating Divisible Resources on Arms with Unknown and Random Rewards.

Wenhao Li Ningyuan Chen

Published in: COLT (2023)

Keyphrases

resource allocation
multi armed bandits
reinforcement learning
resource management
markov decision processes
limited resources
bandit problems
real time
computing resources
expected reward
machine learning
computing environments
long term and short term
multiarmed bandit
partially observed
resource requirements
learning resources
real world