Exploiting Input Tensor Dynamics in Activation Checkpointing for Efficient Training on GPU.
Jianjin LiaoMingzhen LiHailong YangQingxiao SunBiao SunJiwei HaoTianyu FengFengwei YuShengdong ChenYe TaoZicheng ZhangZhongzhi LuanDepei QianPublished in: IPDPS (2023)