Login / Signup
Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement.
Yongji Wu
Wenjie Qu
Tianyang Tao
Zhuang Wang
Wei Bai
Zhuohao Li
Yuan Tian
Jiaheng Zhang
Matthew Lentz
Danyang Zhuo
Published in:
CoRR (2024)
Keyphrases
</>
domain experts
decision trees
statistical models
random fields
gaussian model
data sets
prior knowledge
probability distribution
semi supervised
process model
training algorithm
structured prediction