Login / Signup

On Regret-optimal Cooperative Nonstochastic Multi-armed Bandits.

Jialin YiMilan Vojnovic
Published in: CoRR (2022)
Keyphrases
  • multi armed bandits
  • cooperative
  • multi armed bandit
  • bandit problems
  • worst case
  • online learning
  • reinforcement learning
  • dynamic programming
  • game theory
  • decision making
  • special case