• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Whittle index based Q-learning for restless bandits with average reward.

Konstantin E. AvrachenkovVivek S. Borkar
Published in: Autom. (2022)
Keyphrases