Login / Signup
The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit.
Lorenzo Noci
Chuning Li
Mufan Bill Li
Bobby He
Thomas Hofmann
Chris Maddison
Daniel M. Roy
Published in:
CoRR (2023)
Keyphrases
</>
statistical models
database
fuzzy logic
real time
real world
experimental data
complex systems
regression model
prior knowledge
depth information
machine learning algorithms
evolutionary algorithm
website
decision making
artificial intelligence
data mining
neural network
data sets