Login / Signup
The Base-Rate Effect on LLM Benchmark Performance: Disambiguating Test-Taking Strategies from Benchmark Performance.
Kyle Moore
Jesse Roberts
Thao Pham
Oseremhen Ewaleifoh
Douglas H. Fisher
Published in:
CoRR (2024)
Keyphrases
</>
search engine
reinforcement learning
special case
statistical tests
quantitative evaluation