Login / Signup
Contrastive Preference Learning: Learning from Human Feedback without Reinforcement Learning.
Joey Hejna
Rafael Rafailov
Harshit Sikchi
Chelsea Finn
Scott Niekum
W. Bradley Knox
Dorsa Sadigh
Published in:
ICLR (2024)
Keyphrases
</>
reinforcement learning
preference learning
learning process
learning algorithm
supervised learning
ordinal regression
machine learning
state space
learning problems
gaussian processes
recommender systems
learning tasks
pairwise comparison
prior knowledge
input output