Login / Signup
Towards Optimal Algorithms for Multi-Player Bandits without Collision Sensing Information.
Wei Huang
Richard Combes
Cindy Trinh
Published in:
COLT (2022)
Keyphrases
</>
learning algorithm
worst case
cooperative
information sharing
multi agent
dynamic programming