Login / Signup
On Regret-optimal Cooperative Nonstochastic Multi-armed Bandits.
Jialin Yi
Milan Vojnovic
Published in:
AAMAS (2023)
Keyphrases
</>
multi armed bandits
cooperative
multi armed bandit
bandit problems
worst case
lower bound
optimal solution
dynamic programming
loss function
upper bound
closed form
regret bounds