Of Human Criteria and Automatic Metrics: A Benchmark of the Evaluation of Story Generation.

Cyril Chhun Pierre Colombo Fabian M. Suchanek Chloé Clavel

Published in: COLING (2022)

Keyphrases

evaluation criteria
evaluation methods
story generation
automatic evaluation
human judgments
evaluation metrics
evaluation method
evaluation methodology
fully automatic
real world
evaluation measures
evaluation model
gold standard
human human interaction
ahp method
selection criteria
evaluation process
comparative analysis
human experts
data driven
bayesian networks
artificial intelligence
data mining