GENIE: A Leaderboard for Human-in-the-Loop Evaluation of Text Generation.
Daniel KhashabiGabriel StanovskyJonathan BraggNicholas LourieJungo KasaiYejin ChoiNoah A. SmithDaniel S. WeldPublished in: CoRR (2021)
Keyphrases
- text generation
- natural language generation
- human interaction
- evaluation methods
- database
- evaluation method
- gold standard
- evaluation criteria
- evaluation measures
- human subjects
- natural language
- information systems
- evaluation metrics
- human computer interaction
- information extraction
- ground truth
- probability distribution
- evaluation process