Improving Performance of Triangular Matrix-Vector BLAS Routines on GPUs.
Marek KarwackiPrzemyslaw StpiczynskiPublished in: PARCO (2011)
Keyphrases
- linear algebra
- sparse matrix
- rows and columns
- eigenvalues and eigenvectors
- matrix representation
- singular value decomposition
- symmetric matrix
- highly optimized
- linearly independent
- transformation matrix
- general purpose
- image processing
- parallel processing
- dot product
- computational power
- similarity matrix
- weight matrix
- feature vectors
- neural network
- low rank
- matrix factorization
- efficient implementation
- coefficient matrix
- data sets