Login / Signup
The Fair Contextual Multi-Armed Bandit.
Yifang Chen
Alex Cuellar
Haipeng Luo
Jignesh Modi
Heramb Nemlekar
Stefanos Nikolaidis
Published in:
AAMAS (2020)
Keyphrases
</>
multi armed bandit
multi armed bandits
reinforcement learning
decentralized decision making
learning algorithm
feature selection
objective function
statistical model
regret bounds
bandit problems