Sign in

LLMs as Narcissistic Evaluators: When Ego Inflates Evaluation Scores.

Yiqi LiuNafise Sadat MoosaviChenghua Lin
Published in: CoRR (2023)
Keyphrases
  • evaluation criteria
  • evaluation methods
  • databases
  • high level
  • evaluation model
  • real world
  • machine learning
  • computer vision
  • clustering algorithm
  • three dimensional
  • multi agent
  • evaluation metrics
  • gold standard