Login / Signup
Categorized Bandits.
Matthieu Jedor
Jonathan Louëdec
Vianney Perchet
Published in:
CIRCLE (2020)
Keyphrases
</>
stochastic systems
multi armed bandits
reinforcement learning
semi markov
prior knowledge
multi class
regret bounds