Of Human Criteria and Automatic Metrics: A Benchmark of the Evaluation of Story Generation.
Cyril ChhunPierre ColomboFabian M. SuchanekChloé ClavelPublished in: COLING (2022)
Keyphrases
- evaluation criteria
- evaluation methods
- story generation
- automatic evaluation
- human judgments
- evaluation metrics
- evaluation method
- evaluation methodology
- fully automatic
- real world
- evaluation measures
- evaluation model
- gold standard
- human human interaction
- ahp method
- selection criteria
- evaluation process
- comparative analysis
- human experts
- data driven
- bayesian networks
- artificial intelligence
- data mining