Login / Signup

A Case Study in CUDA Kernel Fusion: Implementing FlashAttention-2 on NVIDIA Hopper Architecture using the CUTLASS Library.

Ganesh BikshandiJay Shah
Published in: CoRR (2023)
Keyphrases