Evaluating performance of Parallel Matrix Multiplication Routine on Intel KNL and Xeon Scalable Processors.
Thi My Tuyen NguyenYoosang ParkJaeyoung ChoiRaehyun KimPublished in: ACSOS Companion (2020)
Keyphrases
- distributed memory
- matrix multiplication
- shared memory
- multi core processors
- multi processor
- single processor
- multiprocessor systems
- data parallelism
- parallel implementation
- message passing
- parallel computers
- parallel processing
- parallel architecture
- parallel programming
- computer architecture
- multithreading
- parallel computation
- parallel computing
- parallel algorithm
- single instruction multiple data
- parallel machines
- parallel execution
- parallel tree search
- commodity hardware
- processing elements
- parallel architectures
- parallel processors
- high end
- processing units
- highly efficient
- processor array
- similarity measure
- three dimensional