Login / Signup
Learning in Restless Bandits under Exogenous Global Markov Process.
Tomer Gafni
Michal Yemini
Kobi Cohen
Published in:
CoRR (2021)
Keyphrases
</>
markov process
learning algorithm
reinforcement learning
online learning
pairwise
prior knowledge
dynamic programming
sufficient conditions
stationary distribution
markov processes