Login / Signup

Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

C. Daniel FreemanLaura CulpAaron ParisiMaxwell L. BileschiGamaleldin F. ElsayedAlex RizkowskyIsabelle SimpsonAlex AlemiAzade NovaBen AdlamBernd BohnetGaurav MishraHanie SedghiIgor MordatchIzzeddin GurJaehoon LeeJohn D. Co-ReyesJeffrey PenningtonKelvin XuKevin SwerskyKshiteej MahajanLechao XiaoRosanne LiuSimon KornblithNoah ConstantPeter J. LiuRoman NovakYundi QianNoah FiedelJascha Sohl-Dickstein
Published in: CoRR (2023)
Keyphrases