Login / Signup

Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms.

Rafael RafailovYaswanth ChittepuRyan ParkHarshit SikchiJoey HejnaW. Bradley KnoxChelsea FinnScott Niekum
Published in: CoRR (2024)
Keyphrases