Login / Signup
Adapting multi-armed bandits policies to contextual bandits scenarios.
David Cortes
Published in:
CoRR (2018)
Keyphrases
</>
multi armed bandits
bandit problems
management policies
contextual information
optimal policy
multi armed bandit
multi armed bandit problems
decision problems
lower bound
special case
sufficient conditions
markov decision problems