Login / Signup
Contrastive Preference Learning: Learning from Human Feedback without RL.
Joey Hejna
Rafael Rafailov
Harshit Sikchi
Chelsea Finn
Scott Niekum
W. Bradley Knox
Dorsa Sadigh
Published in:
CoRR (2023)
Keyphrases
</>
preference learning
reinforcement learning
learning process
learning algorithm
active learning
machine learning
label ranking
learning problems
gaussian processes
pairwise comparison
decision making
supervised learning