On the GPU-CPU Performance Portability of OpenCL for 3D Stencil Computations.
Huayou SuNan WuMei WenChunyuan ZhangXing CaiPublished in: ICPADS (2013)
Keyphrases
- graphics processing units
- parallel computation
- general purpose
- gpu implementation
- parallel processing
- graphics hardware
- parallel computing
- parallel implementation
- computing systems
- graphics processors
- efficient implementation
- compute unified device architecture
- floating point
- massively parallel
- real time
- parallel programming
- high performance computing
- cpu implementation
- processing units
- database systems
- parallel machines
- parallel architectures
- commodity hardware
- information systems
- neural network
- parallel algorithm
- memory management
- programming language