Sign in

Towards Understanding Sycophancy in Language Models.

Mrinank SharmaMeg TongTomasz KorbakDavid DuvenaudAmanda AskellSamuel R. BowmanNewton ChengEsin DurmusZac Hatfield-DoddsScott R. JohnstonShauna KravecTimothy MaxwellSam McCandlishKamal NdousseOliver RauschNicholas SchieferDa YanMiranda ZhangEthan Perez
Published in: CoRR (2023)
Keyphrases