Login / Signup

PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization.

Yidong WangZhuohao YuZhengran ZengLinyi YangCunxiang WangHao ChenChaoya JiangRui XieJindong WangXing XieWei YeShikun ZhangYue Zhang
Published in: CoRR (2023)
Keyphrases
  • automatic evaluation
  • quality assessment
  • human judgments
  • search engine
  • prior knowledge