Login / Signup

metabench - A Sparse Benchmark to Measure General Ability in Large Language Models.

Alex KipnisKonstantinos VoudourisLuca M. Schulze BuschoffEric Schulz
Published in: CoRR (2024)
Keyphrases