On Regret-optimal Cooperative Nonstochastic Multi-armed Bandits.

Jialin Yi Milan Vojnovic

Published in: CoRR (2022)

Keyphrases

multi armed bandits
cooperative
multi armed bandit
bandit problems
worst case
online learning
reinforcement learning
dynamic programming
game theory
decision making
special case