Login / Signup
FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement.
Xiaonan Nie
Xupeng Miao
Zilong Wang
Zichao Yang
Jilong Xue
Lingxiao Ma
Gang Cao
Bin Cui
Published in:
CoRR (2023)
Keyphrases
</>
probabilistic model
statistical model
wide range
text classification
real time
pairwise
feature vectors
human body