Sign in

LiPO: Listwise Preference Optimization through Learning-to-Rank.

Tianqi LiuZhen QinJunru WuJiaming ShenMisha KhalmanRishabh JoshiYao ZhaoMohammad SalehSimon BaumgartnerJialu LiuPeter J. LiuXuanhui Wang
Published in: CoRR (2024)
Keyphrases