Login / Signup

Adversarially Robust Multi-Armed Bandit Algorithm with Variance-Dependent Regret Bounds.

Shinji ItoTaira TsuchiyaJunya Honda
Published in: CoRR (2022)
Keyphrases
  • multi armed bandit
  • learning algorithm
  • regret bounds
  • closed form
  • probabilistic model
  • worst case
  • optimal solution
  • upper bound
  • prediction error