Login / Signup
Cooperative Multi-agent Bandits: Distributed Algorithms with Optimal Individual Regret and Constant Communication Costs.
Lin Yang
Xuchuang Wang
Mohammad Hajiesmaili
Lijun Zhang
John C. S. Lui
Don Towsley
Published in:
CoRR (2023)
Keyphrases
</>
communication cost
worst case
reduce communication cost
multi armed bandit
regret bounds
sensor networks
distributed systems
feature selection
optimal solution
communication overhead
cooperative multi agent
machine learning
learning algorithm
dynamic programming
stochastic systems