High Performance GPU Code Generation for Matrix-Matrix Multiplication using MLIR: Some Early Results.
Navdeep KatelVivek KhandelwalUday BondhugulaPublished in: CoRR (2021)
Keyphrases
- matrix multiplication
- code generation
- distributed memory
- multilingual information retrieval
- parallel implementation
- application development
- graphics processing units
- code generator
- software development
- rapid prototyping
- message passing
- modeling language
- formal specification
- model driven
- software reuse
- shared memory
- matrix factorization
- data driven
- design patterns
- design tools
- parallel algorithm
- higher order
- image segmentation
- databases