Login / Signup

On the Distribution, Sparsity, and Inference-time Quantization of Attention Values in Transformers.

Tianchu JiShraddhan JainMichael FerdmanPeter A. MilderH. Andrew SchwartzNiranjan Balasubramanian
Published in: ACL/IJCNLP (Findings) (2021)
Keyphrases