Login / Signup
Parallelizing Linear Transformers with the Delta Rule over Sequence Length.
Songlin Yang
Bailin Wang
Yu Zhang
Yikang Shen
Yoon Kim
Published in:
CoRR (2024)
Keyphrases
</>
fixed length
association rules
shift register
neural network
expert systems
linear constraints
rule learning
data sets
least squares
linear model
linear systems
rule discovery
simple linear
biological sequences
longest common subsequence