Login / Signup
JoMA: Demystifying Multilayer Transformers via JOint Dynamics of MLP and Attention.
Yuandong Tian
Yiping Wang
Zhenyu Zhang
Beidi Chen
Simon S. Du
Published in:
CoRR (2023)
Keyphrases
</>
multilayer perceptron
dynamical systems
neural network
dynamic model
real time
databases
artificial intelligence
data sets
knowledge base
artificial neural networks
visual attention
multi layer perceptron
joint estimation
neural classifier
layer perceptron