Login / Signup

FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning.

Tri Dao
Published in: CoRR (2023)
Keyphrases