Thompson Sampling for Multinomial Logit Contextual Bandits.
Min-hwan OhGarud IyengarPublished in: NeurIPS (2019)
Keyphrases
- multinomial logit
- contextual information
- context sensitive
- random sampling
- feature selection
- monte carlo
- multi armed bandits
- multi armed bandit
- stochastic systems
- context dependent
- artificial neural networks
- sampling strategy
- sampled data
- contextual knowledge
- sampling algorithm
- sampling strategies
- information retrieval
- multiscale
- objective function
- probabilistic model
- active learning