Login / Signup

Entangled Preferences: The History and Risks of Reinforcement Learning and Human Feedback.

Nathan LambertThomas Krendl GilbertTom Zick
Published in: CoRR (2023)
Keyphrases