Login / Signup
MPipeMoE: Memory Efficient MoE for Pre-trained Models with Adaptive Pipeline Parallelism.
Zheng Zhang
Donglin Yang
Yaqi Xia
Liang Ding
Dacheng Tao
Xiaobo Zhou
Dazhao Cheng
Published in:
IPDPS (2023)
Keyphrases
</>
memory efficient
pre trained
high dimensional
state space