Login / Signup
UCB-based Algorithms for Multinomial Logistic Regression Bandits.
Sanae Amani
Christos Thrampoulidis
Published in:
NeurIPS (2021)
Keyphrases
</>
learning algorithm
worst case
multi armed bandit
computational complexity
bandit problems
data mining
similarity measure
reinforcement learning
lower bound
artificial neural networks
theoretical analysis