Fairness and Welfare Quantification for Regret in Multi-Armed Bandits.

Siddharth Barman Arindam Khan Arnab Maiti Ayush Sawarni

Published in: CoRR (2022)

Keyphrases

multi armed bandits
bandit problems
multi armed bandit
decision problems
game theory
resource allocation
multi armed bandit problems
regret bounds
reinforcement learning
social welfare
lower bound
np hard