Sign in

Are self-explanations from Large Language Models faithful?

Andreas MadsenSarath ChandarSiva Reddy
Published in: CoRR (2024)
Keyphrases