Login / Signup
Do Models Explain Themselves? Counterfactual Simulatability of Natural Language Explanations.
Yanda Chen
Ruiqi Zhong
Narutatsu Ri
Chen Zhao
He He
Jacob Steinhardt
Zhou Yu
Kathleen R. McKeown
Published in:
CoRR (2023)
Keyphrases
</>
language model
probabilistic model
natural language
graphical models
statistical models
decision making
classification models
complex systems
information extraction
model selection
question answering
prior knowledge
experimental data
evolutionary algorithm
information systems
autoregressive
accurate models