Make Inference Faster: Efficient GPU Memory Management for Butterfly Sparse Matrix Multiplication.

Published in: CoRR (2024)

Keyphrases