Sign in

How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions.

Lorenzo PacchiardiAlex J. ChanSören MindermannIlan MoscovitzAlexa Y. PanYarin GalOwain EvansJan Brauner
Published in: CoRR (2023)
Keyphrases