Login / Signup
An Index-based Deterministic Asymptotically Optimal Algorithm for Constrained Multi-armed Bandit Problems.
Hyeong Soo Chang
Published in:
CoRR (2020)
Keyphrases
</>
asymptotically optimal
learning algorithm
computational complexity
np hard
multi dimensional
objective function
optimal solution
dynamic programming
worst case
response time