Login / Signup

Determination of the minimax risk for Bernoulli multi-armed bandit.

Alexander V. Kolnogorov
Published in: ALCOSP (2010)
Keyphrases
  • multi armed bandit
  • multi armed bandits
  • reinforcement learning
  • decision making
  • decentralized decision making
  • learning algorithm
  • worst case