Login / Signup
Thresholding Bandit with Optimal Aggregate Regret.
Chao Tao
Saúl A. Blanco
Jian Peng
Yuan Zhou
Published in:
NeurIPS (2019)
Keyphrases
</>
regret bounds
worst case
dynamic programming
bandit problems
lower bound
online learning
upper confidence bound
multi armed bandit
minimax regret
optimal solution
image segmentation
edge detection
denoising
markov chain
gray level
optimal control
optimal strategy
upper bound
image processing
database