GPU-Aware MPI on RDMA-Enabled Clusters: Design, Implementation and Evaluation.
Hao WangSreeram PotluriDevendar BureddyCarlos RosalesDhabaleswar K. PandaPublished in: IEEE Trans. Parallel Distributed Syst. (2014)
Keyphrases
- parallel implementation
- circuit design
- efficient implementation
- real time
- formative evaluation
- implementation issues
- parallel distributed
- design process
- cluster analysis
- design methodology
- parallel computing
- graphics processing units
- rapid prototyping
- evaluation model
- graphics cards
- architectural design
- parallel algorithm
- general purpose
- case study
- clustering algorithm