Login / Signup

Mirror decent algorithm for a multi-armed bandit governed by a stationary finite state Markov chain.

Alexander V. NazinBoris M. Miller
Published in: ECC (2013)
Keyphrases