Survey Bandits with Regret Guarantees.
Sanath Kumar KrishnamurthySusan AtheyPublished in: CoRR (2020)
Keyphrases
- regret bounds
- multi armed bandit problems
- multi armed bandits
- multi armed bandit
- lower bound
- online learning
- loss function
- worst case
- bandit problems
- stochastic systems
- minimax regret
- upper bound
- decision problems
- binary classification
- data collection
- survey data
- database
- bayesian networks
- decision trees
- feature selection
- machine learning
- data sets
- real time