Login / Signup
Thompson Sampling for Unimodal Bandits.
Long Yang
Zhao Li
Zehong Hu
Shasha Ruan
Shijian Li
Gang Pan
Hongyang Chen
Published in:
CoRR (2021)
Keyphrases
</>
sampling strategies
multi armed bandit
data sets
monte carlo
random sampling
parameter space
sampling methods
stochastic systems
real world
learning algorithm
search engine
website
learning environment
video sequences
sample size
sampling strategy