Login / Signup
Non-Repeatable Experiments and Non-Reproducible Results: The Reproducibility Crisis in Human Evaluation in NLP.
Anya Belz
Craig Thomson
Ehud Reiter
Simon Mille
Published in:
ACL (Findings) (2023)
Keyphrases
</>
natural language processing
information extraction
evaluation methods
databases
artificial intelligence
database
data sets
neural network
information systems
natural language
human subjects
evaluation method