Overflow Aware Quantization: Accelerating Neural Network Inference by Low-bit Multiply-Accumulate Operations.

Hongwei Xie Yafei Song Ling Cai Mingyang Li

Published in: IJCAI (2020)

Keyphrases

neural network
uniform quantization
logical operations
artificial neural networks
neural network model
feed forward
neural network is trained
bit wise
adaptive quantization
feed forward neural networks
network architecture
probabilistic inference
associative memory
fault diagnosis
bayesian inference
floating point
back propagation
fuzzy logic
multi layer perceptron
inference process
learning vector quantization
fuzzy artmap
gray code
video coding
knn