Efficient Shared-Memory Implementation of High-Performance Conjugate Gradient Benchmark and its Application to Unstructured Matrices.
Jongsoo ParkMikhail SmelyanskiyKarthikeyan VaidyanathanAlexander HeineckeDhiraj D. KalamkarXing LiuMd. Mostofa Ali PatwaryYutong LuPradeep DubeyPublished in: SC (2014)
Keyphrases
- shared memory
- low overhead
- conjugate gradient
- parallel computers
- distributed memory
- parallel architectures
- message passing
- shared memory multiprocessors
- parallel computing
- efficient implementation
- parallel algorithm
- training algorithm
- shared memory multiprocessor
- parallel processing
- learning algorithm
- singular value decomposition
- belief propagation
- fuzzy logic
- pairwise
- feature space