Multi-Stage Memory Efficient Strassen's Matrix Multiplication on GPU.
Arjun Gopala KrishnanDhrubajyoti GoswamiPublished in: HiPC (2021)
Keyphrases
- memory efficient
- multistage
- matrix multiplication
- message passing
- distributed memory
- parallel implementation
- single stage
- stochastic programming
- production system
- dynamic programming
- external memory
- lot sizing
- matrix factorization
- parallel computing
- iterative deepening
- stochastic optimization
- graphics processing units
- attack detection
- shared memory
- belief propagation
- optimal policy
- parallel processing
- state space
- stereo matching
- objective function