CUDA Kernel Based Collective Reduction Operations on Large-scale GPU Clusters.
Ching-Hsiang ChuKhaled HamidoucheAkshay VenkateshAmmar Ahmad AwanDhabaleswar K. PandaPublished in: CCGrid (2016)
Keyphrases
- parallel implementation
- gpu accelerated
- graphics processors
- gpu implementation
- graphics hardware
- parallel computing
- kernel based clustering
- clustering algorithm
- parallel computation
- compute unified device architecture
- general purpose
- real time
- small scale
- real life
- collective behavior
- hierarchical clustering
- parallel algorithm
- graphics processing units
- collective intelligence
- kernel methods
- data clustering
- cluster analysis
- parallel programming
- support vector machine
- data points
- real world
- shared memory
- processing units
- document clustering
- fuzzy clustering
- overlapping clusters
- data structure
- neural network