Login / Signup

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback.

Stephen CasperXander DaviesClaudia ShiThomas Krendl GilbertJérémy ScheurerJavier RandoRachel FreedmanTomasz KorbakDavid LindnerPedro FreireTony Tong WangSamuel MarksCharbel-Raphaël SégerieMicah CarrollAndi PengPhillip J. K. ChristoffersenMehul DamaniStewart SlocumUsman AnwarAnand SiththaranjanMax NadeauEric J. MichaudJacob PfauDmitrii KrasheninnikovXin ChenLauro LangoscoPeter HaseErdem BiyikAnca D. DraganDavid KruegerDorsa SadighDylan Hadfield-Menell
Published in: Trans. Mach. Learn. Res. (2023)
Keyphrases