Thresholding Bandit with Optimal Aggregate Regret.

Chao Tao Saúl A. Blanco Jian Peng Yuan Zhou

Published in: NeurIPS (2019)

Keyphrases

regret bounds
worst case
dynamic programming
bandit problems
lower bound
online learning
upper confidence bound
multi armed bandit
minimax regret
optimal solution
image segmentation
edge detection
denoising
markov chain
gray level
optimal control
optimal strategy
upper bound
image processing
database