Login / Signup
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation.
Tianxing He
Jingyu Zhang
Tianle Wang
Sachin Kumar
Kyunghyun Cho
James R. Glass
Yulia Tsvetkov
Published in:
ACL (1) (2023)
Keyphrases
</>
evaluation metrics
text generation
natural language generation
precision and recall
average precision
evaluation framework
evaluation methodology
learning to rank
evaluation methods
natural language
evaluation measures
dialogue system
artificial intelligence