Login / Signup
Boosting Distributed Training Performance of the Unpadded BERT Model.
Jinle Zeng
Min Li
Zhihua Wu
Jiaqi Liu
Yuang Liu
Dianhai Yu
Yanjun Ma
Published in:
CoRR (2022)
Keyphrases
</>
probabilistic model
learning algorithm
mathematical model
computational model
distributed environment
data sets
test data
statistical model
face detection
parameter estimation
training samples
theoretical analysis
probability distribution
cost function
multi agent systems
cooperative
similarity measure