Versatile Dueling Bandits: Best-of-both World Analyses for Learning from Relative Preferences.
Aadirupa SahaPierre GaillardPublished in: ICML (2022)
Keyphrases
- learning process
- learning algorithm
- learning problems
- learning systems
- inductive learning
- genetic algorithm
- learning tasks
- decision trees
- reinforcement learning
- database
- prior knowledge
- objective function
- incremental learning
- knowledge acquisition
- online learning
- decision making
- feature selection
- machine learning
- neural network