Login / Signup
Theoretical guarantees on the best-of-n alignment policy.
Ahmad Beirami
Alekh Agarwal
Jonathan Berant
Alexander D'Amour
Jacob Eisenstein
Chirag Nagpal
Ananda Theertha Suresh
Published in:
CoRR (2024)
Keyphrases
</>
theoretical guarantees
policy iteration
worst case
optimal policy
machine learning
reinforcement learning
markov decision process
bregman divergences
objective function
graphical models
infinite horizon
model free