Login / Signup
Not All Metrics Are Guilty: Improving NLG Evaluation by Diversifying References.
Tianyi Tang
Hongyuan Lu
Yuchen Jiang
Haoyang Huang
Dongdong Zhang
Wayne Xin Zhao
Tom Kocmi
Furu Wei
Published in:
NAACL-HLT (2024)
Keyphrases
</>
evaluation metrics
evaluation methods
evaluation criteria
natural language generation
evaluation model
databases
evaluation measures
evaluation method
comparative evaluation
evaluation process
data sets
data mining
information systems
similarity measure
natural language
precision and recall