Ps and Qs: Quantization-aware pruning for efficient low latency neural network inference.

Published in: CoRR (2021)

Keyphrases