Login / Signup
MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory for Long Sequences Training.
Cheng Luo
Jiawei Zhao
Zhuoming Chen
Beidi Chen
Anima Anandkumar
Published in:
CoRR (2024)
Keyphrases
</>
long sequences
training samples
mining sequential patterns
fuzzy logic
supervised learning
data streams
training set
power system
training process
fault diagnosis
computer software
memory space
memory usage
training phase
training algorithm
main memory
real time
training examples
hidden markov models
data structure