Adaptive runtime tuning of parallel sparse matrix-vector multiplication on distributed memory systems.
Seyong LeeRudolf EigenmannPublished in: ICS (2008)
Keyphrases
- distributed memory
- sparse matrix
- shared memory
- ibm sp
- parallel implementation
- matrix multiplication
- floating point
- multiprocessor systems
- data parallelism
- parallel computers
- message passing
- computer systems
- parallel machines
- data sets
- massively parallel
- computer architecture
- distributed systems
- probabilistic model
- dynamic programming
- feature space