Login / Signup

LoongTrain: Efficient Training of Long-Sequence LLMs with Head-Context Parallelism.

Diandian GuPeng SunQinghao HuTing HuangXun ChenYingtong XiongGuoteng WangQiaoling ChenShangchun ZhaoJiarui FangYonggang WenTianwei ZhangXin JinXuanzhe Liu
Published in: CoRR (2024)
Keyphrases