Login / Signup
Pre-RMSNorm and Pre-CRMSNorm Transformers: Equivalent and Efficient Pre-LN Transformers.
Zixuan Jiang
Jiaqi Gu
Hanqing Zhu
David Z. Pan
Published in:
NeurIPS (2023)
Keyphrases
</>
databases
database
wide range
neural network
learning algorithm
computer vision
feature selection
three dimensional
data structure
expert systems
association rules
multiresolution
lightweight