Memory Optimized Dynamic Matrix Chain Multiplication Using Shared Memory in GPU.
Girish BiswasNandini MukherjeePublished in: ICDCIT (2021)
Keyphrases
- shared memory
- parallel computing
- parallel computation
- parallel programming
- multithreading
- message passing
- distributed memory
- memory access
- parallel algorithm
- parallel architectures
- graphic processing unit
- shared memory multiprocessors
- parallel architecture
- commodity hardware
- parallel processing
- address space
- multi processor
- linear algebra
- compute unified device architecture
- interprocess communication
- parallel computers
- parallel machines
- computational power
- parallel execution
- real time
- heterogeneous platforms
- parallel implementation
- computer vision