Forced Exploration in Bandit Problems.

Han Qi Fei Guo Li Zhu

Published in: CoRR (2023)

Keyphrases

bandit problems
exploration exploitation
decision problems
multi armed bandits
data sets
active learning
multi armed bandit problems
reinforcement learning
special case
upper bound
graphical models
decentralized decision making