Login / Signup
Bounded Regret for Finitely Parameterized Multi-Armed Bandits.
Kishan Panaganti
Dileep M. Kalathil
Published in:
CoRR (2020)
Keyphrases
</>
multi armed bandits
bandit problems
multi armed bandit
decision problems
inductive inference
reinforcement learning
finite number
multi armed bandit problems
active learning
optimal strategy