Thompson Sampling for Unimodal Bandits.

Long Yang Zhao Li Zehong Hu Shasha Ruan Shijian Li Gang Pan Hongyang Chen

Published in: CoRR (2021)

Keyphrases

sampling strategies
multi armed bandit
data sets
monte carlo
random sampling
parameter space
sampling methods
stochastic systems
real world
learning algorithm
search engine
website
learning environment
video sequences
sample size
sampling strategy