Theoretical guarantees on the best-of-n alignment policy.

Ahmad Beirami Alekh Agarwal Jonathan Berant Alexander D'Amour Jacob Eisenstein Chirag Nagpal Ananda Theertha Suresh

Published in: CoRR (2024)

Keyphrases