Login / Signup
Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers.
Machel Reid
Edison Marrese-Taylor
Yutaka Matsuo
Published in:
EMNLP (Findings) (2021)
Keyphrases
</>
computational complexity
data driven
generative model
unsupervised learning
case study
feature space
parameter space