Login / Signup
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness.
Tri Dao
Daniel Y. Fu
Stefano Ermon
Atri Rudra
Christopher Ré
Published in:
NeurIPS (2022)
Keyphrases
</>
memory efficient
external memory
iterative deepening
highly efficient
visual attention
website
focus of attention
multiple sequence alignment
computer vision
knowledge base
data structure
search algorithm
xml documents
exact solution
integral image