Optimizing Matrix Operations on a Parallel Multiprocessor with a Memory Hierarchical System.
William JalbyUlrike MeierPublished in: ICPP (1986)
Keyphrases
- distributed memory
- matrix multiplication
- processing elements
- level parallelism
- multiprocessor systems
- ibm sp
- database machines
- single processor
- shared memory
- parallel hardware
- multithreading
- parallel computers
- parallel implementation
- parallel processors
- parallel processing
- multi threaded
- massively parallel
- distributed shared memory
- highly parallel
- shared memory multiprocessor
- processing units
- associative memory
- parallel computing
- neural network
- random access
- memory requirements
- rows and columns
- computational power
- memory footprint
- singular value decomposition
- coarse to fine
- low rank
- memory usage
- computer architecture
- computing power
- parallel architectures
- limited memory
- matrix representation
- compute intensive
- memory bandwidth
- parallel execution
- storage media
- response time
- memory space