Login / Signup

How Many Layers and Why? An Analysis of the Model Depth in Transformers.

Antoine SimoulinBenoît Crabbé
Published in: ACL (student) (2021)
Keyphrases
  • mathematical model
  • formal model
  • bayesian networks
  • cost function
  • management system
  • computational model
  • empirical data
  • neural network
  • case study
  • multi agent
  • probabilistic model