Benchmarks and Algorithms for Offline Preference-Based Reward Learning.

Daniel Shin Anca D. Dragan Daniel S. Brown

Published in: Trans. Mach. Learn. Res. (2023)

Keyphrases

learning algorithm
reinforcement learning
noise tolerant
learning process
significant improvement
online learning
optimization problems
active learning
computational complexity
real time
learning systems
learning tasks
learning models
data structure
learning problems
computational cost
clustering algorithm
machine learning
machine learning algorithms
orders of magnitude
data sets