One Practical Algorithm for Both Stochastic and Adversarial Bandits.
Yevgeny SeldinAleksandrs SlivkinsPublished in: ICML (2014)
Keyphrases
- monte carlo
- experimental evaluation
- detection algorithm
- cost function
- optimization algorithm
- dynamic programming
- improved algorithm
- matching algorithm
- worst case
- selection algorithm
- preprocessing
- clustering method
- np hard
- learning algorithm
- estimation algorithm
- computationally efficient
- high accuracy
- hidden markov models
- k means
- artificial neural networks
- multi agent
- computational complexity
- classification algorithm
- search algorithm
- path planning
- convex hull
- memory requirements
- recognition algorithm
- optimal solution
- data sets
- maximum flow