Login / Signup
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study.
Shusheng Xu
Wei Fu
Jiaxuan Gao
Wenjie Ye
Weilin Liu
Zhiyu Mei
Guangju Wang
Chao Yu
Yi Wu
Published in:
CoRR (2024)
Keyphrases
</>
data sets
image segmentation
image alignment
mobile devices