A Generalized Acquisition Function for Preference-based Reward Learning.
Evan EllisGaurav R. GhosalStuart J. RussellAnca D. DraganErdem BiyikPublished in: CoRR (2024)
Keyphrases
- learning algorithm
- learning process
- learning problems
- reinforcement learning
- online learning
- learning systems
- supervised learning
- unsupervised learning
- mobile learning
- learning community
- policy gradient
- neural network
- inductive inference
- learning scheme
- learning analytics
- incremental learning
- decision problems
- genetic algorithm