Sign in

Emergent Modularity in Pre-trained Transformers.

Zhengyan ZhangZhiyuan ZengYankai LinChaojun XiaoXiaozhi WangXu HanZhiyuan LiuRuobing XieMaosong SunJie Zhou
Published in: ACL (Findings) (2023)
Keyphrases
  • pre trained
  • training data
  • training examples
  • control signals
  • wide range
  • neural network
  • decision trees
  • learning process
  • d objects
  • dimensionality reduction