Login / Signup
Non-Stochastic Multi-Player Multi-Armed Bandits: Optimal Rate With Collision Information, Sublinear Without.
Sébastien Bubeck
Yuanzhi Li
Yuval Peres
Mark Sellke
Published in:
COLT (2020)
Keyphrases
</>
learning algorithm
upper bound