Login / Signup

Nash Learning from Human Feedback.

Rémi MunosMichal ValkoDaniele CalandrielloMohammad Gheshlaghi AzarMark RowlandZhaohan Daniel GuoYunhao TangMatthieu GeistThomas MesnardAndrea MichiMarco SelviSertan GirginNikola MomchevOlivier BachemDaniel J. MankowitzDoina PrecupBilal Piot
Published in: CoRR (2023)
Keyphrases