Shfl-BW: Accelerating Deep Neural Network Inference with Tensor-Core Aware Weight Pruning.
Guyue HuangHaoran LiMinghai QinFei SunYufei DingYuan XiePublished in: CoRR (2022)
Keyphrases
- neural network
- higher order
- weight update
- artificial neural networks
- back propagation
- inference process
- high order
- pattern recognition
- feed forward
- search space
- fuzzy logic
- pruning method
- probabilistic inference
- bayesian inference
- learning vector quantization
- genetic algorithm
- neural network is trained
- belief nets
- network architecture
- multi layer
- bp neural network
- dimensionality reduction
- activation function
- weighting scheme
- feed forward neural networks
- deep learning
- network model
- pruning algorithm
- fuzzy artmap
- pruning algorithms
- neural network model
- bayesian networks