Improving Generalization of Alignment with Human Preferences through Group Invariant Learning.

Published in: CoRR (2023)

Keyphrases