Login / Signup

State-Aware Value Function Approximation with Attention Mechanism for Restless Multi-armed Bandits.

Shuang WuJingyu ZhaoGuangjian TianJun Wang
Published in: IJCAI (2021)
Keyphrases
  • multi armed bandits
  • state space
  • attention mechanism
  • machine learning
  • pairwise
  • real time
  • probability distribution
  • upper bound
  • human computer interaction
  • optimal control