Login / Signup
An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter Optimization.
Yimin Huang
Yujun Li
Zhenguo Li
Zhihua Zhang
Published in:
CoRR (2020)
Keyphrases
</>
asymptotically optimal
learning algorithm
dynamic programming
expectation maximization
similarity measure
objective function
optimal solution
data structure
worst case
em algorithm
multi armed bandit