Login / Signup
Distributional Preference Alignment of LLMs via Optimal Transport.
Igor Melnyk
Youssef Mroueh
Brian Belgodere
Mattia Rigotti
Apoorva Nitsure
Mikhail Yurochkin
Kristjan H. Greenewald
Jirí Navrátil
Jerret Ross
Published in:
CoRR (2024)
Keyphrases
</>
dynamic programming
data sets
optimal design
database
user preferences
real time
learning algorithm
optimal solution
preference elicitation