Sign in

Thompson Sampling Regret Bounds for Contextual Bandits with sub-Gaussian rewards.

Amaury GouverneurBorja Rodríguez GálvezTobias J. OechteringMikael Skoglund
Published in: ISIT (2023)
Keyphrases