• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization for Language Models.

Zhanhui ZhouJie LiuChao YangJing ShaoYu LiuXiangyu YueWanli OuyangYu Qiao
Published in: CoRR (2023)
Keyphrases