Login / Signup

On the Performance and Memory Footprint of Distributed Training: An Empirical Study on Transformers.

Zhengxian LuFangyu WangZhiwei XuFei YangTao Li
Published in: CoRR (2024)
Keyphrases