Login / Signup
On Regret-optimal Cooperative Nonstochastic Multi-armed Bandits.
Jialin Yi
Milan Vojnovic
Published in:
CoRR (2022)
Keyphrases
</>
multi armed bandits
cooperative
multi armed bandit
bandit problems
worst case
online learning
reinforcement learning
dynamic programming
game theory
decision making
special case