Sign in

CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation.

Pei KeBosi WenZhuoer FengXiao LiuXuanyu LeiJiale ChengShengyuan WangAohan ZengYuxiao DongHongning WangJie TangMinlie Huang
Published in: CoRR (2023)
Keyphrases
  • high quality
  • natural language
  • evaluation method
  • databases
  • evolutionary algorithm
  • dynamic programming
  • information extraction
  • programming language