Login / Signup
Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers.
Machel Reid
Edison Marrese-Taylor
Yutaka Matsuo
Published in:
CoRR (2021)
Keyphrases
</>
data mining
artificial neural networks
generative model
information systems
multi agent
knowledge sharing
parameter space
parameter settings
data sharing
input parameters
edge weights