Login / Signup

Optimizing Large Model Training through Overlapped Activation Recomputation.

Ping ChenWenjie ZhangShuibing HeYingjie GuZhuwei PengKexin HuangXuan ZhanWeijian ChenYi ZhengZhefeng WangYanlong YinGang Chen
Published in: CoRR (2024)
Keyphrases