Login / Signup
PowerNorm: Rethinking Batch Normalization in Transformers.
Sheng Shen
Zhewei Yao
Amir Gholami
Michael W. Mahoney
Kurt Keutzer
Published in:
ICML (2020)
Keyphrases
</>
preprocessing
batch mode
batch processing
normalization method
computer vision
technology enhanced learning
clustering algorithm
neural network
real world
information systems
data structure
search algorithm
data streams
online algorithms
batch size