Login / Signup

Memory-efficient Training of LLMs with Larger Mini-batches.

Dang NguyenWenhan YangRathul AnandYu YangBaharan Mirzasoleiman
Published in: CoRR (2024)
Keyphrases