An Asymptotically Optimal Algorithm for the Max k-Armed Bandit Problem.

Matthew J. Streeter Stephen F. Smith

Published in: AAAI (2006)

Keyphrases

learning algorithm
asymptotically optimal
dynamic programming
machine learning
decision making
optimal solution
search space
real time
objective function
computational complexity
upper bound
worst case
electronic commerce