Login / Signup
An Asymptotically Optimal Algorithm for the Max k-Armed Bandit Problem.
Matthew J. Streeter
Stephen F. Smith
Published in:
AAAI (2006)
Keyphrases
</>
learning algorithm
asymptotically optimal
dynamic programming
machine learning
decision making
optimal solution
search space
real time
objective function
computational complexity
upper bound
worst case
electronic commerce