Improved Optimistic Algorithm For The Multinomial Logit Contextual Bandit.
Priyank AgrawalVashist AvadhanulaTheja TulabandhulaPublished in: CoRR (2020)
Keyphrases
- improved algorithm
- dynamic programming
- optimization algorithm
- cost function
- worst case
- contextual bandit
- k means
- np hard
- computational cost
- information extraction
- upper confidence bound
- detection algorithm
- em algorithm
- expectation maximization
- simulated annealing
- information retrieval
- evolutionary algorithm
- recommender systems
- objective function
- web pages