Login / Signup
Multi-Armed Bandits with Bounded Arm-Memory: Near-Optimal Guarantees for Best-Arm Identification and Regret Minimization.
Arnab Maiti
Vishakha Patil
Arindam Khan
Published in:
NeurIPS (2021)
Keyphrases
</>
multi armed bandits
regret minimization
computational complexity
state space