Login / Signup

LATTE: Low-Precision Approximate Attention with Head-wise Trainable Threshold for Efficient Transformer.

Jiing-Ping WangMing-Guang LinAn-Yeu Wu
Published in: CoRR (2024)
Keyphrases
  • pairwise
  • visual attention
  • real world
  • search engine
  • website
  • multi agent
  • fuzzy logic
  • computationally efficient
  • power system