Keyphrases
- bandit problems
- exploration exploitation
- decision problems
- multi armed bandits
- active learning
- artificial intelligence
- machine learning
- probability distribution
- decentralized decision making
- dynamic programming
- optimization problems
- experimental data
- benchmark problems
- expected utility
- multi armed bandit problems