Shfl-BW: accelerating deep neural network inference with tensor-core aware weight pruning.

Guyue Huang Haoran Li Minghai Qin Fei Sun Yufei Ding Yuan Xie

Published in: DAC (2022)

Keyphrases

neural network
artificial neural networks
neural network model
higher order
high order
bayesian inference
search space
weight update
pruning method
inference process
belief nets
feed forward
neural network is trained
deep learning
feed forward neural networks
fuzzy logic
pattern recognition
genetic algorithm
multi layer
radial basis function
fault diagnosis
bayesian networks
pruning methods
synaptic weights
tensor space
pruning algorithm
probabilistic inference
neural nets
belief networks