Optimal Regret Bounds for Collaborative Learning in Bandits.

Amitis Shidani Sattar Vakili

Published in: CoRR (2023)

Keyphrases

regret bounds
collaborative learning
multi armed bandit
online learning
lower bound
linear regression
reinforcement learning
similarity measure
learning process
upper bound
multi class
linear predictors
online convex optimization