Login / Signup
An Efficient Algorithm for Fair Multi-Agent Multi-Armed Bandit with Low Regret.
Matthew Jones
Huy L. Nguyen
Thy Dinh Nguyen
Published in:
AAAI (2023)
Keyphrases
</>
multi armed bandit
learning algorithm
multi agent
worst case
reinforcement learning
computational complexity
np hard
probabilistic model
similarity measure
objective function
prediction error
markov chain monte carlo
multi armed bandits