Ps and Qs: Quantization-Aware Pruning for Efficient Low Latency Neural Network Inference.

Published in: Frontiers Artif. Intell. (2021)

Keyphrases