Sign in

FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement.

Xiaonan NieXupeng MiaoZilong WangZichao YangJilong XueLingxiao MaGang CaoBin Cui
Published in: CoRR (2023)
Keyphrases
  • probabilistic model
  • statistical model
  • wide range
  • text classification
  • real time
  • pairwise
  • feature vectors
  • human body