Performance Characterization of a Hierarchical MPI Implementation on Large-scale Distributed-memory Platforms.
Sadaf R. AlamRichard F. BarrettJeffery A. KuehnSteve PoolePublished in: ICPP (2009)
Keyphrases
- distributed memory
- parallel implementation
- ibm sp
- shared memory
- parallel computers
- parallel architecture
- multiprocessor systems
- fine grain
- scientific computing
- parallel algorithm
- matrix multiplication
- parallel computing
- parallel computation
- multi processor
- parallel processing
- data parallelism
- efficient implementation
- massively parallel
- computer architecture
- message passing