Login / Signup

MPipeMoE: Memory Efficient MoE for Pre-trained Models with Adaptive Pipeline Parallelism.

Zheng ZhangDonglin YangYaqi XiaLiang DingDacheng TaoXiaobo ZhouDazhao Cheng
Published in: IPDPS (2023)
Keyphrases
  • memory efficient
  • pre trained
  • high dimensional
  • state space