Login / Signup
Observe Before Play: Multi-armed Bandit with Pre-Observations.
Jinhang Zuo
Xiaoxi Zhang
Carlee Joe-Wong
Published in:
SIGMETRICS Perform. Evaluation Rev. (2018)
Keyphrases
</>
multi armed bandit
multi armed bandits
reinforcement learning
image sequences
special case
graphical models