Hybrid CUDA, OpenMP, and MPI parallel programming on multicore GPU clusters.
Chao-Tung YangChih-Lin HuangCheng-Fang LinPublished in: Comput. Phys. Commun. (2011)
Keyphrases
- parallel programming
- shared memory
- parallel computing
- parallel algorithm
- parallel computation
- parallel processing
- multi core processors
- graphics processing units
- message passing interface
- highly parallel
- cloud computing
- programming environment
- massively parallel
- compute unified device architecture
- high end
- processing units
- message passing
- parallel implementation
- multicore processors
- parallel execution
- distributed memory
- general purpose
- parallel architectures
- parallel computers
- high performance computing
- image segmentation
- gpu implementation
- memory bandwidth
- database systems