Collaborative Multi-agent Stochastic Linear Bandits.

Ahmadreza Moradipari Mohammad Ghavamzadeh Mahnoosh Alizadeh

Published in: ACC (2022)

Keyphrases

multi agent
stochastic systems
regret bounds
multi agent systems
cooperative
monte carlo
team formation
simple linear
agent oriented
case study
multi armed bandits
geographically dispersed
linear systems
intelligent agents
online learning
collaborative learning
least squares
neural network
multiple agents
linear model
collaborative environment
cooperative agents
lower bound