Login / Signup
How Many Layers and Why? An Analysis of the Model Depth in Transformers.
Antoine Simoulin
Benoît Crabbé
Published in:
ACL (student) (2021)
Keyphrases
</>
mathematical model
formal model
bayesian networks
cost function
management system
computational model
empirical data
neural network
case study
multi agent
probabilistic model