Benchmarking LLMs via Uncertainty Quantification.

Fanghua YeMingming YangJianhui PangLongyue WangDerek F. WongEmine YilmazShuming ShiZhaopeng Tu
Published in: CoRR (2024)
Keyphrases
  • databases
  • uncertain data
  • inherent uncertainty
  • knowledge base
  • database
  • computer vision
  • multi agent
  • natural language
  • quantitative evaluation
  • belief functions
  • decision theory
  • robust optimization
  • measurement error