Login / Signup
Generalizing Reward Modeling for Out-of-Distribution Preference Learning.
Chen Jia
Published in:
CoRR (2024)
Keyphrases
</>
preference learning
ordinal regression
probability distribution
gaussian processes
information retrieval
multi class
pairwise comparison
machine learning
reinforcement learning
rough sets