Login / Signup
AMSP: Super-Scaling LLM Training via Advanced Model States Partitioning.
Qiaoling Chen
Qinghao Hu
Zhisheng Ye
Guoteng Wang
Peng Sun
Yonggang Wen
Tianwei Zhang
Published in:
CoRR (2023)
Keyphrases
</>
computational model
training data
probabilistic model
probability distribution
experimental data
formal model
management system
maximum likelihood
mathematical model
simulation model
prediction model
autoregressive