Login / Signup
Parm: Efficient Training of Large Sparsely-Activated Models with Dedicated Schedules.
Xinglin Pan
Wenxiang Lin
Shaohuai Shi
Xiaowen Chu
Weinong Sun
Bo Li
Published in:
CoRR (2024)
Keyphrases
</>
scheduling problem
training samples
machine learning
website
statistical models
cost effective
prior knowledge
probabilistic model
model selection
structured prediction
data mining
statistical model
complex systems
lightweight
graphical models
support vector machine
training set
data structure
face recognition