Login / Signup
Stochastic Multi-Armed Bandits with Control Variates.
Arun Verma
Manjesh Kumar Hanawal
Published in:
NeurIPS (2021)
Keyphrases
</>
multi armed bandits
bandit problems
control system
multi armed bandit
dynamic programming
monte carlo
optimal control
support vector
search space
evolutionary algorithm
special case
maximum likelihood