Login / Signup
Further Optimal Regret Bounds for Thompson Sampling
Shipra Agrawal
Navin Goyal
Published in:
CoRR (2012)
Keyphrases
</>
regret bounds
multi armed bandit
optimal solution
monte carlo
sample size
linear regression
lower bound
worst case
particle filter
closed form
sampling algorithm