Login / Signup
Linear bandits with polylogarithmic minimax regret.
Josep Lumbreras
Marco Tomamichel
Published in:
COLT (2024)
Keyphrases
</>
minimax regret
preference elicitation
utility function
probability distribution