Debugging CUDA Accelerated Parallel Applications with TotalView.
Chris GottbrathRoyd LüdtkePublished in: Parallel Tools Workshop (2011)
Keyphrases
- parallel programming
- parallel implementation
- parallel algorithm
- parallel computing
- parallel computation
- parallel processing
- shared memory
- multi core processors
- massively parallel
- compute unified device architecture
- parallel hardware
- processing units
- cloud computing
- case study
- times faster
- distributed memory
- message passing interface
- database
- computer architecture
- general purpose
- depth first search
- learning algorithm
- machine learning
- fault localization
- data mining