Automatic Parallelization of Kernels in Shared-Memory Multi-GPU Nodes.
Javier CabezasLluís VilanovaIsaac GeladoThomas B. JablinNacho NavarroWen-mei W. HwuPublished in: ICS (2015)
Keyphrases
- shared memory
- parallel computing
- parallel programming
- parallel computation
- parallel algorithm
- distributed memory
- graphic processing unit
- message passing
- parallel architectures
- compute unified device architecture
- multi processor
- coarse grained
- shared memory multiprocessor
- parallel execution
- parallel machines
- parallel architecture
- processing units
- address space
- commodity hardware
- parallel computers
- multi core systems
- multithreading
- parallel implementation
- parallel processing
- message passing interface
- computer architecture
- image processing
- pairwise
- real time
- wireless sensor networks
- high performance computing
- interprocess communication
- belief propagation
- heterogeneous platforms