A Generalized Acquisition Function for Preference-based Reward Learning.

Evan Ellis Gaurav R. Ghosal Stuart J. Russell Anca D. Dragan Erdem Biyik

Published in: CoRR (2024)

Keyphrases

learning algorithm
learning process
learning problems
reinforcement learning
online learning
learning systems
supervised learning
unsupervised learning
mobile learning
learning community
policy gradient
neural network
inductive inference
learning scheme
learning analytics
incremental learning
decision problems
genetic algorithm