Use of parallel level 3 BLAS in LU factorization on three vector multiprocessors the ALLIANT FX/80, the CRAY-2, and the IBM 3090 VF.
Michel J. DaydéIain S. DuffPublished in: ICS (1990)
Keyphrases
- distributed memory
- shared memory
- scientific computing
- parallel implementation
- parallel computers
- data parallelism
- parallel processing
- vector space
- pairwise
- singular value decomposition
- higher level
- parallel computing
- levels of abstraction
- coarse grained
- massively parallel
- feature vectors
- highly parallel
- neural network
- real time