Code Generation and Optimization of Distributed-Memory Dense Linear Algebra Kernels.
Bryan MarkerDon S. BatoryRobert A. van de GeijnPublished in: ICCS (2013)
Keyphrases
- linear algebra
- distributed memory
- code generation
- parallel computers
- shared memory
- computer architecture
- application development
- parallel implementation
- image processing
- singular value decomposition
- software development
- modeling language
- rapid prototyping
- software reuse
- radon transform
- parallel machines
- model driven
- parallel processing
- data processing
- image segmentation