Login / Signup

AI Alignment through Reinforcement Learning from Human Feedback? Contradictions and Limitations.

Adam Dahlgren LindströmLeila MethnaniLea KrausePetter EricsonÍñigo Martinez de Rituerto de TroyaDimitri Coelho MolloRoel Dobbe
Published in: CoRR (2024)
Keyphrases