Sign in

What happens if you treat ordinal ratings as interval data? Human evaluations in NLP are even more under-powered than you think.

David M. HowcroftVerena Rieser
Published in: EMNLP (1) (2021)
Keyphrases