Login / Signup

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study.

Shusheng XuWei FuJiaxuan GaoWenjie YeWeilin LiuZhiyu MeiGuangju WangChao YuYi Wu
Published in: CoRR (2024)
Keyphrases
  • data sets
  • image segmentation
  • image alignment
  • mobile devices