Login / Signup
Optimal Regret Bounds for Collaborative Learning in Bandits.
Amitis Shidani
Sattar Vakili
Published in:
CoRR (2023)
Keyphrases
</>
regret bounds
collaborative learning
multi armed bandit
online learning
lower bound
linear regression
reinforcement learning
similarity measure
learning process
upper bound
multi class
linear predictors
online convex optimization