• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Theoretical guarantees on the best-of-n alignment policy.

Ahmad BeiramiAlekh AgarwalJonathan BerantAlexander D'AmourJacob EisensteinChirag NagpalAnanda Theertha Suresh
Published in: CoRR (2024)
Keyphrases