Login / Signup
The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit.
Lorenzo Noci
Chuning Li
Mufan Bill Li
Bobby He
Thomas Hofmann
Chris J. Maddison
Dan Roy
Published in:
NeurIPS (2023)
Keyphrases
</>
artificial intelligence
model selection
complex systems
high quality
d objects
statistical models
visual attention
parametric models
databases
data mining
probabilistic model
regression model
classification models
autoregressive