Login / Signup

Observe Before Play: Multi-armed Bandit with Pre-Observations.

Jinhang ZuoXiaoxi ZhangCarlee Joe-Wong
Published in: SIGMETRICS Perform. Evaluation Rev. (2018)
Keyphrases
  • multi armed bandit
  • multi armed bandits
  • reinforcement learning
  • image sequences
  • special case
  • graphical models