Login / Signup
PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization.
Yidong Wang
Zhuohao Yu
Zhengran Zeng
Linyi Yang
Cunxiang Wang
Hao Chen
Chaoya Jiang
Rui Xie
Jindong Wang
Xing Xie
Wei Ye
Shikun Zhang
Yue Zhang
Published in:
CoRR (2023)
Keyphrases
</>
automatic evaluation
quality assessment
human judgments
search engine
prior knowledge