Sign in

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback.

Stephen CasperXander DaviesClaudia ShiThomas Krendl GilbertJérémy ScheurerJavier RandoRachel FreedmanTomasz KorbakDavid LindnerPedro FreireTony WangSamuel MarksCharbel-Raphaël SégerieMicah CarrollAndi PengPhillip J. K. ChristoffersenMehul DamaniStewart SlocumUsman AnwarAnand SiththaranjanMax NadeauEric J. MichaudJacob PfauDmitrii KrasheninnikovXin ChenLauro LangoscoPeter HaseErdem BiyikAnca D. DraganDavid KruegerDorsa SadighDylan Hadfield-Menell
Published in: CoRR (2023)
Keyphrases