Login / Signup

Self-accelerated Thompson sampling with near-optimal regret upper bound.

Zhenyu ZhuLiusheng HuangHongli Xu
Published in: Neurocomputing (2020)
Keyphrases