On the performance of various parallel GMRES implementations on CPU and GPU clusters.
Efstathios I. IoannidisNikolaos CheimariosAntony N. SpyropoulosAndreas G. BoudouvisPublished in: CoRR (2019)
Keyphrases
- graphics processing units
- graphics processors
- parallel implementation
- general purpose
- parallel processing
- parallel implementations
- cpu implementation
- gpu implementation
- parallel computation
- efficient implementation
- parallel programming
- parallel computing
- pc cluster
- memory bandwidth
- graphics hardware
- multi threaded
- compute unified device architecture
- massively parallel
- real time
- clustering algorithm
- parallel algorithm
- multithreading
- computing systems
- hierarchical clustering
- data clustering
- cluster analysis
- level parallelism
- floating point
- data transfer
- shared memory
- data points
- high performance computing
- single instruction multiple data
- heterogeneous computing
- overlapping clusters
- scientific computing
- self organizing maps
- neural network