Login / Signup
Not All Metrics Are Guilty: Improving NLG Evaluation with LLM Paraphrasing.
Tianyi Tang
Hongyuan Lu
Yuchen Eleanor Jiang
Haoyang Huang
Dongdong Zhang
Wayne Xin Zhao
Furu Wei
Published in:
CoRR (2023)
Keyphrases
</>
evaluation metrics
evaluation measures
natural language generation
evaluation methods
evaluation criteria
genetic algorithm
evaluation methodology
real time
evaluation method
context specific
artificial intelligence
web pages
database systems
context sensitive
gold standard
content oriented xml retrieval