Micro-architectural Enhancements in Distributed Memory CGRAs for LU and QR Factorizations.
Farhad MerchantArka MaityMahesh MahadurkarKapil VatwaniIshan MunjeMadhava Krishna CNalesh SivanandanNandhini GopalanSoumyendu RahaS. K. NandyRanjani NarayanPublished in: VLSI Design (2015)
Keyphrases
- distributed memory
- coarse grained
- shared memory
- level parallelism
- ibm sp
- parallel implementation
- fine grained
- multiprocessor systems
- parallel computers
- software architecture
- singular value decomposition
- parallel architecture
- data partitioning
- data parallelism
- parallel algorithm
- multithreading
- parallel processing
- parallel machines
- matrix multiplication
- message passing
- parallel programming
- parallel computing
- parallel computation
- artificial intelligence
- matrix factorization