Login / Signup

SimPO: Simple Preference Optimization with a Reference-Free Reward.

Yu MengMengzhou XiaDanqi Chen
Published in: CoRR (2024)
Keyphrases