Login / Signup
GENIE: Toward Reproducible and Standardized Human Evaluation for Text Generation.
Daniel Khashabi
Gabriel Stanovsky
Jonathan Bragg
Nicholas Lourie
Jungo Kasai
Yejin Choi
Noah A. Smith
Daniel S. Weld
Published in:
EMNLP (2022)
Keyphrases
</>
text generation
natural language generation
evaluation method
evaluation metrics
gold standard
real time
evaluation criteria
data sets
artificial intelligence
natural language
domain knowledge
comparative evaluation