Login / Signup

What Language Model to Train if You Have One Million GPU Hours?

Teven Le ScaoThomas WangDaniel HesslowLucile SaulnierStas BekmanM. Saiful BariStella BidermanHady ElsaharNiklas MuennighoffJason PhangOfir PressColin RaffelVictor SanhSheng ShenLintang SutawikaJaesung TaeZheng Xin YongJulien LaunayIz Beltagy
Published in: CoRR (2022)
Keyphrases