Login / Signup

An Empirical Study of LLM-as-a-Judge for LLM Evaluation: Fine-tuned Judge Models are Task-specific Classifiers.

Hui HuangYingqi QuJing LiuMuyun YangTiejun Zhao
Published in: CoRR (2024)
Keyphrases