VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference.

Published in: MLSys (2021)

Keyphrases