Login / Signup

The Impact of Preference Agreement in Reinforcement Learning from Human Feedback: A Case Study in Summarization.

Sian GoodingHassan Mansoor
Published in: CoRR (2023)
Keyphrases