Login / Signup

Multi-armed bandit with sub-exponential rewards.

Huiwen JiaCong ShiSiqian Shen
Published in: Oper. Res. Lett. (2021)
Keyphrases
  • multi armed bandits
  • multi armed bandit
  • reinforcement learning
  • bandit problems
  • decentralized decision making
  • lower bound
  • learning algorithm