Login / Signup

Fairness and Welfare Quantification for Regret in Multi-Armed Bandits.

Siddharth BarmanArindam KhanArnab MaitiAyush Sawarni
Published in: CoRR (2022)
Keyphrases
  • multi armed bandits
  • bandit problems
  • multi armed bandit
  • decision problems
  • game theory
  • resource allocation
  • multi armed bandit problems
  • regret bounds
  • reinforcement learning
  • social welfare
  • lower bound
  • np hard