Login / Signup

HumanRankEval: Automatic Evaluation of LMs as Conversational Assistants.

Milan GrittaGerasimos LampourasIgnacio Iacobacci
Published in: NAACL-HLT (2024)
Keyphrases
  • automatic evaluation
  • e learning
  • human judgments
  • quality assessment
  • high quality
  • learning process
  • natural language
  • domain knowledge
  • input image
  • co occurrence