CUDA M3: Designing Efficient CUDA Managed Memory-Aware MPI by Exploiting GDR and IPC.
Khaled HamidoucheAmmar Ahmad AwanAkshay VenkateshDhabaleswar K. PandaPublished in: HiPC (2016)
Keyphrases
- parallel implementation
- general purpose
- parallel computing
- graphics processors
- parallel algorithm
- shared memory
- message passing interface
- gpu implementation
- parallel programming
- parallel computation
- massively parallel
- high efficiency
- computationally expensive
- lightweight
- genetic algorithm
- cost effective
- operating system
- learning algorithm
- memory space
- distributed memory
- memory efficient
- limited memory
- data sets
- real time
- database