Simultaneously Learning Stochastic and Adversarial Bandits under the Position-Based Model.
Cheng ChenCanzhe ZhaoShuai LiPublished in: AAAI (2022)
Keyphrases
- mathematical model
- prior knowledge
- learning process
- probability distribution
- stochastic model
- learning algorithm
- learning models
- learning mechanism
- learning systems
- learning phase
- management system
- probabilistic model
- boltzmann machine
- e learning
- learned models
- connectionist networks
- multi armed bandits
- hidden variables
- unsupervised learning
- learning objects
- objective function
- similarity measure
- decision trees