Learning While Scheduling in Multi-Server Systems With Unknown Statistics: MaxWeight with Discounted UCB.
Zixian YangR. SrikantLei YingPublished in: AISTATS (2023)
Keyphrases
- learning systems
- learning algorithm
- learning process
- complex systems
- knowledge acquisition
- distributed systems
- learning problems
- learning tasks
- online learning
- data sets
- multi agent
- reinforcement learning
- information systems
- artificial intelligence
- dynamic programming
- decision making
- learning community
- intelligent behavior
- bandit problems