PAC-Bayes Bounds for Bandit Problems: A Survey and Experimental Comparison.
Hamish FlynnDavid ReebMelih KandemirJan PetersPublished in: CoRR (2022)
Keyphrases
- pac bayes
- experimental comparison
- bandit problems
- risk bounds
- multi armed bandits
- generalization bounds
- decision problems
- linear classifiers
- data dependent
- empirical risk minimization
- feature selection
- generalization ability
- multi armed bandit problems
- vc dimension
- statistical learning theory
- learning theory
- hyperplane
- model selection
- state space
- support vector
- learning algorithm