Login / Signup
Forced Exploration in Bandit Problems.
Han Qi
Fei Guo
Li Zhu
Published in:
CoRR (2023)
Keyphrases
</>
bandit problems
exploration exploitation
decision problems
multi armed bandits
data sets
active learning
multi armed bandit problems
reinforcement learning
special case
upper bound
graphical models
decentralized decision making