Login / Signup
A Regret bound for Non-stationary Multi-Armed Bandits with Fairness Constraints.
Shaarad A. R
Ambedkar Dukkipati
Published in:
CoRR (2020)
Keyphrases
</>
non stationary
multi armed bandits
multi armed bandit
online learning
regret bounds
empirical mode decomposition
upper bound
bandit problems
support vector
lower bound
outlier detection
linear regression