Login / Signup

Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers.

Machel ReidEdison Marrese-TaylorYutaka Matsuo
Published in: EMNLP (Findings) (2021)
Keyphrases
  • computational complexity
  • data driven
  • generative model
  • unsupervised learning
  • case study
  • feature space
  • parameter space