Login / Signup

Investigating Low-Rank Training in Transformer Language Models: Efficiency and Scaling Analysis.

Xiuying WeiSkander MoallaRazvan PascanuCaglar Gulcehre
Published in: CoRR (2024)
Keyphrases