Login / Signup
Emergent Modularity in Pre-trained Transformers.
Zhengyan Zhang
Zhiyuan Zeng
Yankai Lin
Chaojun Xiao
Xiaozhi Wang
Xu Han
Zhiyuan Liu
Ruobing Xie
Maosong Sun
Jie Zhou
Published in:
CoRR (2023)
Keyphrases
</>
pre trained
training data
training examples
control signals
decision trees
wide range
appearance variations