Implementation of CG Method on GPU Cluster with Proprietary Interconnect TCA for GPU Direct Communication.
Kazuya MatsumotoToshihiro HanawaYuetsu KodamaHisafumi FujiiTaisuke BokuPublished in: IPDPS Workshops (2015)
Keyphrases
- high accuracy
- high precision
- computational complexity
- real time
- gpu accelerated
- computational cost
- clustering algorithm
- pairwise
- cost function
- parallel implementation
- detection method
- dynamic programming
- preprocessing
- image processing
- gpu implementation
- parallel computation
- graphics processing units
- cluster of workstations
- parallel algorithm
- segmentation method
- clustering method
- open source
- significant improvement