Co-exploration of NLA kernels and specification of Compute Elements in distributed memory CGRAs.
Mahesh MahadurkarFarhad MerchantArka MaityKapil VatwaniIshan MunjeNandhini GopalanS. K. NandyRanjani NarayanPublished in: ICSAMOS (2014)
Keyphrases
- distributed memory
- shared memory
- coarse grained
- multiprocessor systems
- level parallelism
- ibm sp
- parallel implementation
- parallel architecture
- data parallelism
- parallel computers
- multithreading
- high level
- message passing
- matrix multiplication
- parallel machines
- parallel processing
- parallel algorithm
- data partitioning
- graph cuts
- artificial intelligence