Collaborative Multi-agent Stochastic Linear Bandits.
Ahmadreza MoradipariMohammad GhavamzadehMahnoosh AlizadehPublished in: ACC (2022)
Keyphrases
- multi agent
- stochastic systems
- regret bounds
- multi agent systems
- cooperative
- monte carlo
- team formation
- simple linear
- agent oriented
- case study
- multi armed bandits
- geographically dispersed
- linear systems
- intelligent agents
- online learning
- collaborative learning
- least squares
- neural network
- multiple agents
- linear model
- collaborative environment
- cooperative agents
- lower bound