Login / Signup
Emergent Modularity in Pre-trained Transformers.
Zhengyan Zhang
Zhiyuan Zeng
Yankai Lin
Chaojun Xiao
Xiaozhi Wang
Xu Han
Zhiyuan Liu
Ruobing Xie
Maosong Sun
Jie Zhou
Published in:
ACL (Findings) (2023)
Keyphrases
</>
pre trained
training data
training examples
control signals
wide range
neural network
decision trees
learning process
d objects
dimensionality reduction