Login / Signup
Double Thompson Sampling for Dueling Bandits.
Huasen Wu
Xin Liu
R. Srikant
Published in:
CoRR (2016)
Keyphrases
</>
multi armed bandit
sample size
random sampling
neural network
monte carlo
database
artificial intelligence
least squares
sampling algorithm
sampling methods