Login / Signup
An Optimal Dynamic Mechanism for Multi-Armed Bandit Processes
Sham M. Kakade
Ilan Lobel
Hamid Nazerzadeh
Published in:
CoRR (2010)
Keyphrases
</>
multi armed bandit
multi armed bandits
machine learning
reinforcement learning
knn
decision making
optimal solution
lower bound
worst case