Login / Signup

Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization.

Junkang WuYuexiang XieZhengyi YangJiancan WuJiawei ChenJinyang GaoBolin DingXiang WangXiangnan He
Published in: CoRR (2024)
Keyphrases