Login / Signup

Lessons from the Trenches on Reproducible Evaluation of Language Models.

Stella BidermanHailey SchoelkopfLintang SutawikaLeo GaoJonathan TowBaber AbbasiAlham Fikri AjiPawan Sasanka AmmanamanchiSidney BlackJordan CliveAnthony DiPofiJulen EtxanizBenjamin FattoriJessica Zosa FordeCharles FosterJeffrey HsuMimansa JaiswalWilson Y. LeeHaonan LiCharles LoveringNiklas MuennighoffEllie PavlickJason PhangAviya SkowronSamson TanXiangru TangKevin A. WangGenta Indra WinataFrançois YvonAndy Zou
Published in: CoRR (2024)
Keyphrases