Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation.
Yixin LiuAlexander R. FabbriPengfei LiuYilun ZhaoLinyong NanRuilin HanSimeng HanShafiq JotyChien-Sheng WuCaiming XiongDragomir RadevPublished in: ACL (1) (2023)