Login / Signup
Fairness and Welfare Quantification for Regret in Multi-Armed Bandits.
Siddharth Barman
Arindam Khan
Arnab Maiti
Ayush Sawarni
Published in:
CoRR (2022)
Keyphrases
</>
multi armed bandits
bandit problems
multi armed bandit
decision problems
game theory
resource allocation
multi armed bandit problems
regret bounds
reinforcement learning
social welfare
lower bound
np hard