Preference-based Online Learning with Dueling Bandits: A Survey.
Viktor BengsRóbert Busa-FeketeAdil El Mesaoudi-PaulEyke HüllermeierPublished in: J. Mach. Learn. Res. (2021)
Keyphrases
- online learning
- regret bounds
- e learning
- multi armed bandits
- higher education
- distance education
- blended learning
- computer mediated
- learning management systems
- distance learning
- stochastic systems
- active learning
- online course
- online algorithms
- classroom learning
- online education
- database
- online learning environments
- multi armed bandit problems
- statistically significant
- case based reasoning
- lower bound
- optimal solution
- data mining
- databases