KL-UCB-Switch: Optimal Regret Bounds for Stochastic Bandits from Both a Distribution-Dependent and a Distribution-Free Viewpoints.

Published in: J. Mach. Learn. Res. (2022)

Keyphrases