On the Cost of a General GPU Framework: The Strange Case of CUDA 4.0 vs. CUDA 5.0.
Matthew WezowiczMichela TauferPublished in: SC Companion (2012)
Keyphrases
- parallel implementation
- gpu implementation
- general purpose
- parallel computing
- gpu accelerated
- real time
- main contribution
- graphics processors
- graphics hardware
- theoretical framework
- machine learning
- conceptual framework
- closely related
- special case
- parallel computation
- general theory
- shared memory
- compute unified device architecture