Login / Signup
Make Inference Faster: Efficient GPU Memory Management for Butterfly Sparse Matrix Multiplication.
Antoine Gonon
Léon Zheng
Pascal Carrivain
Quoc-Tung Le
Published in:
CoRR (2024)
Keyphrases
</>
memory management
matrix multiplication
highly efficient
computer vision
real time
operating system
matrix factorization
hardware implementation
high dimensional
query processing
multi dimensional
parallel computing