Login / Signup
To what extent do human explanations of model behavior align with actual model behavior?
Grusha Prasad
Yixin Nie
Mohit Bansal
Robin Jia
Douwe Kiela
Adina Williams
Published in:
BlackboxNLP@EMNLP (2021)
Keyphrases
</>
theoretical framework
mathematical model
statistical model
experimental data
prior knowledge
probability distribution
behavioral model
machine learning
face recognition
hidden markov models
theoretical analysis
formal model
human subjects
network model