Login / Signup

Implementing and Optimizing the Scaled Dot-Product Attention on Streaming Dataflow.

Gina SohnNathan ZhangKunle Olukotun
Published in: CoRR (2024)
Keyphrases
  • dot product
  • feature space
  • kernel function
  • gaussian kernels
  • similarity function
  • scalar product