Implementation of 3D FFTs Across Multiple GPUs in Shared Memory Environments.
Nimalan NandapalanJirí JarosAlistair P. RendellBradley E. TreebyPublished in: PDCAT (2012)
Keyphrases
- shared memory
- parallel programming
- low overhead
- parallel architectures
- parallel algorithm
- parallel architecture
- message passing
- shared memory multiprocessors
- shared memory multiprocessor
- parallel computing
- parallel computers
- distributed memory
- commodity hardware
- multi core systems
- interprocess communication
- multi processor
- graphics processing units
- graphic processing unit
- parallel computation
- parallel processing
- general purpose
- address space
- higher order
- parallel execution
- scheduling problem
- memory access
- multithreading
- hardware implementation
- operating system
- efficient implementation
- massively parallel
- computational power