Login / Signup
Achieving the Pareto Frontier of Regret Minimization and Best Arm Identification in Multi-Armed Bandits.
Zixin Zhong
Wang Chi Cheung
Vincent Y. F. Tan
Published in:
Trans. Mach. Learn. Res. (2023)
Keyphrases
</>
multi armed bandits
regret minimization
bandit problems
multi armed bandit
artificial intelligence
e learning
bi objective