Thompson Sampling for Multi-Objective Multi-Armed Bandits Problem.
Saba Q. YahyaaBernard ManderickPublished in: ESANN (2015)
Keyphrases
- multi objective
- multi armed bandits
- multi armed bandit
- evolutionary algorithm
- multi objective optimization
- optimization algorithm
- bandit problems
- multiple objectives
- genetic algorithm
- objective function
- nsga ii
- reinforcement learning
- multi objective optimization problems
- monte carlo
- pareto optimal
- markov chain monte carlo
- sample size
- machine learning
- multi class
- bayesian networks