Login / Signup
Defining and Detecting Vulnerability in Human Evaluation Guidelines: A Preliminary Study Towards Reliable NLG Evaluation.
Jie Ruan
Wenqing Wang
Xiaojun Wan
Published in:
NAACL-HLT (2024)
Keyphrases
</>
feature selection
expert systems
evaluation model
database
real time
databases
image processing
natural language
empirical evaluation
evaluation criteria
gold standard
evaluation process
natural language generation