Login / Signup
Accelerating Transformer Pre-Training with 2: 4 Sparsity.
Yuezhou Hu
Kang Zhao
Weiyu Huang
Jianfei Chen
Jun Zhu
Published in:
CoRR (2024)
Keyphrases
</>
training algorithm
training set
fuzzy logic
neural network
training process
power system
training samples
database
high dimensional
training examples
high voltage
computer software
test set
expert systems
decision trees
knowledge base
learning algorithm
data sets