Allocating Divisible Resources on Arms with Unknown and Random Rewards.
Wenhao LiNingyuan ChenPublished in: COLT (2023)
Keyphrases
- resource allocation
- multi armed bandits
- reinforcement learning
- resource management
- markov decision processes
- limited resources
- bandit problems
- real time
- computing resources
- expected reward
- machine learning
- computing environments
- long term and short term
- multiarmed bandit
- partially observed
- resource requirements
- learning resources
- real world