Login / Signup

Whittle index based Q-learning for restless bandits with average reward.

Konstantin E. AvrachenkovVivek S. Borkar
Published in: Autom. (2022)
Keyphrases