Login / Signup
Adam Accumulation to Reduce Memory Footprints of Both Activations and Gradients for Large-Scale DNN Training.
Yijia Zhang
Yibo Han
Shijie Cao
Guohao Dai
Youshan Miao
Ting Cao
Fan Yang
Ningyi Xu
Published in:
ECAI (2023)
Keyphrases
</>
training process
training data
small scale
training set
memory usage
training algorithm
computing power
real life
training examples
test set
labelled data
significantly reduced
supervised learning
real world
neural network
artificial neural networks
learning algorithm
past experience
classifier training
database