Login / Signup
Optimus-CC: Efficient Large NLP Model Training with 3D Parallelism Aware Communication Compression.
Jaeyong Song
Jinkyu Yim
Jaewon Jung
Hongsun Jang
Hyung-Jin Kim
Youngsok Kim
Jinho Lee
Published in:
ASPLOS (2) (2023)
Keyphrases
</>
mathematical model
formal model
multiscale
management system
probabilistic model
computational model
theoretical analysis
parallel architectures
data compression
visual information
statistical model
training examples
neural network
em algorithm
prior knowledge
data structure
bayesian networks
information retrieval