Login / Signup
A tractable online learning algorithm for the multinomial logit contextual bandit.
Priyank Agrawal
Theja Tulabandhula
Vashist Avadhanula
Published in:
Eur. J. Oper. Res. (2023)
Keyphrases
</>
learning algorithm
contextual bandit
multinomial logit
online learning
upper confidence bound
training data
active learning
feature selection
batch mode
reinforcement learning
supervised learning
machine learning
computational complexity
learning process
search engine
semi supervised learning
online algorithms