Login / Signup

A dynamic programming strategy to balance exploration and exploitation in the bandit problem.

Olivier CaelenGianluca Bontempi
Published in: Ann. Math. Artif. Intell. (2010)
Keyphrases