Login / Signup

The Base-Rate Effect on LLM Benchmark Performance: Disambiguating Test-Taking Strategies from Benchmark Performance.

Kyle MooreJesse RobertsThao PhamOseremhen EwaleifohDouglas H. Fisher
Published in: CoRR (2024)
Keyphrases
  • search engine
  • reinforcement learning
  • special case
  • statistical tests
  • quantitative evaluation