Login / Signup

Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts.

Yueqin YinZhendong WangYi GuHai HuangWeizhu ChenMingyuan Zhou
Published in: CoRR (2024)
Keyphrases