Optimal Fixed-Budget Best Arm Identification using the Augmented Inverse Probability Weighting Estimator in Two-Armed Gaussian Bandits with Unknown Variances.
Masahiro KatoKaito AriuMasaaki ImaizumiMasatoshi UeharaMasahiro NomuraChao QinPublished in: CoRR (2022)