Login / Signup

Seq1F1B: Efficient Sequence-Level Pipeline Parallelism for Large Language Model Training.

Ao SunWeilin ZhaoXu HanCheng YangZhiyuan LiuChuan ShiMaosong Sun
Published in: CoRR (2024)
Keyphrases