The Fair Contextual Multi-Armed Bandit.

Yifang Chen Alex Cuellar Haipeng Luo Jignesh Modi Heramb Nemlekar Stefanos Nikolaidis

Published in: AAMAS (2020)

Keyphrases

multi armed bandit
multi armed bandits
reinforcement learning
decentralized decision making
learning algorithm
feature selection
objective function
statistical model
regret bounds
bandit problems