Login / Signup
Online Least Squares Estimation with Self-Normalized Processes: An Application to Bandit Problems
Yasin Abbasi-Yadkori
Dávid Pál
Csaba Szepesvári
Published in:
CoRR (2011)
Keyphrases
</>
bandit problems
multi armed bandits
online learning
decision problems
real time
similarity measure
decision makers