Login / Signup

Private Benchmarking to Prevent Contamination and Improve Comparative Evaluation of LLMs.

Nishanth ChandranSunayana SitaramDivya GuptaRahul SharmaKashish MittalManohar Swaminathan
Published in: CoRR (2024)
Keyphrases
  • comparative evaluation
  • neural network
  • information systems
  • scoring methods
  • real time
  • text categorization
  • privacy preserving
  • private data