RewardBench: Evaluating Reward Models for Language Modeling.
Nathan LambertValentina PyatkinJacob MorrisonLJ MirandaBill Yuchen LinKhyathi Raghavi ChanduNouha DziriSachin KumarTom ZickYejin ChoiNoah A. SmithHannaneh HajishirziPublished in: CoRR (2024)
Keyphrases