Login / Signup

On effectiveness of the Mirror Decent Algorithm for a stochastic multi-armed bandit governed by a stationary finite Markov chain.

Alexander V. NazinBoris M. Miller
Published in: AuCC (2013)
Keyphrases