Sign in

GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference.

Ali Hadi ZadehIsak EdoOmar Mohamed AwadAndreas Moshovos
Published in: MICRO (2020)
Keyphrases