Critical initialization of wide and deep neural networks through partial Jacobians: general theory and applications to LayerNorm.
Darshil DoshiTianyu HeAndrey GromovPublished in: CoRR (2021)
Keyphrases
- general theory
- neural network
- morphological operators
- pattern recognition
- mathematical theory
- fuzzy logic
- stable models
- artificial neural networks
- back propagation
- feature extraction
- computer vision
- closed form
- genetic algorithm
- machine learning
- k means
- edge detection
- image enhancement
- high order
- belief functions
- databases