Sign in

BranchNorm: Robustly Scaling Extremely Deep Transformers.

Yijin LiuXianfeng ZengFandong MengJie Zhou
Published in: CoRR (2023)
Keyphrases
  • knowledge base
  • deep learning
  • high quality
  • search algorithm
  • multiresolution
  • query processing
  • multiple layers