Overflow Aware Quantization: Accelerating Neural Network Inference by Low-bit Multiply-Accumulate Operations.
Hongwei XieYafei SongLing CaiMingyang LiPublished in: IJCAI (2020)
Keyphrases
- neural network
- uniform quantization
- logical operations
- artificial neural networks
- neural network model
- feed forward
- neural network is trained
- bit wise
- adaptive quantization
- feed forward neural networks
- network architecture
- probabilistic inference
- associative memory
- fault diagnosis
- bayesian inference
- floating point
- back propagation
- fuzzy logic
- multi layer perceptron
- inference process
- learning vector quantization
- fuzzy artmap
- gray code
- video coding
- knn