Sign in
A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences.
Odalric-Ambrym Maillard
Rémi Munos
Gilles Stoltz
Published in:
COLT (2011)
Keyphrases
</>
kullback leibler
cross entropy
kl divergence
machine learning
distance measure
multi armed bandits
gaussian mixture model
kullback leibler divergence
bandit problems