A General Theoretical Paradigm to Understand Learning from Human Preferences.

Published in: AISTATS (2024)

Keyphrases