A General Theoretical Paradigm to Understand Learning from Human Preferences.
Mohammad Gheshlaghi AzarZhaohan Daniel GuoBilal PiotRémi MunosMark RowlandMichal ValkoDaniele CalandrielloPublished in: AISTATS (2024)
Keyphrases
- learning algorithm
- learning process
- special case
- knowledge acquisition
- decision theoretic
- online learning
- decision making
- learning community
- theoretical analysis
- closely related
- learning mechanism
- neural network
- human experts
- learning problems
- learning tasks
- learning systems
- user preferences
- unsupervised learning
- supervised learning
- prior knowledge
- training set
- decision trees
- genetic algorithm