Preference-based Online Learning with Dueling Bandits: A Survey.

Viktor Bengs Róbert Busa-Fekete Adil El Mesaoudi-Paul Eyke Hüllermeier

Published in: J. Mach. Learn. Res. (2021)

Keyphrases

online learning
regret bounds
e learning
multi armed bandits
higher education
distance education
blended learning
computer mediated
learning management systems
distance learning
stochastic systems
active learning
online course
online algorithms
classroom learning
online education
database
online learning environments
multi armed bandit problems
statistically significant
case based reasoning
lower bound
optimal solution
data mining
databases