C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
LightSeq: Accelerated Training for Transformer-based Models on GPUs.
Xiaohui Wang
Ying Xiong
Xian Qian
Yang Wei
Lei Li
Mingxuan Wang
Published in:
CoRR (2021)
Keyphrases
</>
statistical models
prior knowledge
general purpose
information systems
training set
probabilistic model
training samples
database
case study
feature space
graphical models
neural network model
efficient implementation
accurate models