A General Theoretical Paradigm to Understand Learning from Human Preferences.
Mohammad Gheshlaghi AzarMark RowlandBilal PiotDaniel GuoDaniele CalandrielloMichal ValkoRémi MunosPublished in: CoRR (2023)
Keyphrases
- learning algorithm
- learning systems
- learning process
- closely related
- motor skills
- learning analytics
- data sets
- online learning
- knowledge acquisition
- user preferences
- special case
- learning scheme
- theoretical analysis
- collaborative learning
- background knowledge
- human activities
- learning tasks
- human experts
- student learning
- language acquisition
- training data
- facilitate learning