Login / Signup
Auto-tuning Dense Matrix Multiplication for GPGPU with Cache.
Xiang Cui
Yifeng Chen
Changyou Zhang
Hong Mei
Published in:
ICPADS (2010)
Keyphrases
</>
matrix multiplication
message passing
database workloads
distributed memory
query processing
matrix factorization
prefetching
main memory
energy aware
probabilistic model
computer vision
computational complexity
stereo matching