Asymptotically Optimal Thompson Sampling Based Policy for the Uniform Bandits and the Gaussian Bandits.

Published in: CoRR (2023)

Keyphrases