Thompson Sampling for Multinomial Logit Contextual Bandits.

Min-hwan Oh Garud Iyengar

Published in: NeurIPS (2019)

Keyphrases

multinomial logit
contextual information
context sensitive
random sampling
feature selection
monte carlo
multi armed bandits
multi armed bandit
stochastic systems
context dependent
artificial neural networks
sampling strategy
sampled data
contextual knowledge
sampling algorithm
sampling strategies
information retrieval
multiscale
objective function
probabilistic model
active learning