Login / Signup

MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences.

Souradip ChakrabortyJiahao QiuHui YuanAlec KoppelFurong HuangDinesh ManochaAmrit Singh BediMengdi Wang
Published in: CoRR (2024)
Keyphrases