Sign in

Pre-RMSNorm and Pre-CRMSNorm Transformers: Equivalent and Efficient Pre-LN Transformers.

Zixuan JiangJiaqi GuHanqing ZhuDavid Z. Pan
Published in: CoRR (2023)
Keyphrases
  • cost effective
  • real time
  • information retrieval
  • social networks
  • image processing
  • search algorithm
  • natural language
  • expert systems