Parallel Matrix-Matrix Multiplication Based on HPL with a GPU-Accelerated PC Cluster.
Qin WangJunichi OhmuraAxida ShanTakefumi MiyoshiHidetsugu IrieTsutomu YoshinagaPublished in: ICNC (2010)
Keyphrases
- pc cluster
- matrix multiplication
- gpu accelerated
- real time
- distributed memory
- dynamic load balancing
- parallel processing
- parallel algorithm
- data partitioning
- message passing
- shared memory
- finite element
- matrix factorization
- personal computer
- simulation tool
- parallel implementation
- b tree
- missing data
- scheduling problem