Improving Generalization of Alignment with Human Preferences through Group Invariant Learning.

Published in: ICLR (2024)

Keyphrases