Login / Signup
Sequential Multi-hypothesis Testing in Multi-armed Bandit Problems: An Approach for Asymptotic Optimality.
Gayathri R. Prabhu
Srikrishna Bhashyam
Aditya Gopalan
Rajesh Sundaresan
Published in:
CoRR (2020)
Keyphrases
</>
hypothesis testing
asymptotic optimality
asymptotically optimal
likelihood ratio
multi armed bandit problems
statistical tests
sufficient conditions
hypothesis test
special case
robust statistical
likelihood ratio test
training data
reinforcement learning
flowshop
confidence intervals