Login / Signup
DeepNet: Scaling Transformers to 1, 000 Layers.
Hongyu Wang
Shuming Ma
Li Dong
Shaohan Huang
Dongdong Zhang
Furu Wei
Published in:
CoRR (2022)
Keyphrases
</>
data mining
machine learning
computer vision
decision trees
case study
color images
multi layer
single layer
multiple layers