Feel-Good Thompson Sampling for Contextual Dueling Bandits.
Xuheng LiHeyang ZhaoQuanquan GuPublished in: CoRR (2024)
Keyphrases
- random sampling
- monte carlo
- sampling strategies
- multi armed bandit
- contextual information
- sampling strategy
- parameter space
- sampling methods
- sampling algorithm
- context sensitive
- sample size
- artificial intelligence
- databases
- markov chain monte carlo
- support vector machine
- information systems
- sampled data
- data mining
- neural network