On Regret-optimal Cooperative Nonstochastic Multi-armed Bandits.

Jialin Yi Milan Vojnovic

Published in: AAMAS (2023)

Keyphrases

multi armed bandits
cooperative
multi armed bandit
bandit problems
worst case
lower bound
optimal solution
dynamic programming
loss function
upper bound
closed form
regret bounds