A Regret bound for Non-stationary Multi-Armed Bandits with Fairness Constraints.

Shaarad A. R Ambedkar Dukkipati

Published in: CoRR (2020)

Keyphrases

non stationary
multi armed bandits
multi armed bandit
online learning
regret bounds
empirical mode decomposition
upper bound
bandit problems
support vector
lower bound
outlier detection
linear regression