NAS Parallel Benchmarks with CUDA and beyond.
Gabriell Alves de AraujoDalvan GrieblerDinei A. RockenbachMarco DaneluttoLuiz Gustavo FernandesPublished in: Softw. Pract. Exp. (2023)
Keyphrases
- shared memory
- parallel implementation
- parallel computing
- parallel programming
- parallel computation
- compute unified device architecture
- distributed memory
- parallel algorithm
- parallel processing
- message passing
- times faster
- general purpose
- machine learning
- distributed processing
- image segmentation
- computer architecture
- learning algorithm
- parallel architectures
- parallel execution
- data sets