Shfl-BW: accelerating deep neural network inference with tensor-core aware weight pruning.
Guyue HuangHaoran LiMinghai QinFei SunYufei DingYuan XiePublished in: DAC (2022)
Keyphrases
- neural network
- artificial neural networks
- neural network model
- higher order
- high order
- bayesian inference
- search space
- weight update
- pruning method
- inference process
- belief nets
- feed forward
- neural network is trained
- deep learning
- feed forward neural networks
- fuzzy logic
- pattern recognition
- genetic algorithm
- multi layer
- radial basis function
- fault diagnosis
- bayesian networks
- pruning methods
- synaptic weights
- tensor space
- pruning algorithm
- probabilistic inference
- neural nets
- belief networks