• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

State-Aware Value Function Approximation with Attention Mechanism for Restless Multi-armed Bandits.

Shuang WuJingyu ZhaoGuangjian TianJun Wang
Published in: IJCAI (2021)
Keyphrases
  • multi armed bandits
  • state space
  • attention mechanism
  • machine learning
  • pairwise
  • real time
  • probability distribution
  • upper bound
  • human computer interaction
  • optimal control