Sign in

SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning.

Hanrui WangZhekai ZhangSong Han
Published in: HPCA (2021)
Keyphrases