Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from Large Language Models.

Chirag AgarwalSree Harsha TanneruHimabindu Lakkaraju
Published in: CoRR (2024)
Keyphrases