High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach.
Mikhail SmelyanskiyKarthikeyan VaidyanathanJee W. ChoiBálint JoóJatin ChhuganiMichael A. ClarkPradeep DubeyPublished in: SC (2011)
Keyphrases
- multithreading
- parallel implementation
- computer systems
- shared memory
- distributed memory
- multi threaded
- computing systems
- parallel computing
- data intensive
- massively parallel
- data access
- complex systems
- data management
- prefetching
- parallel programming
- high end
- parallel computers
- web caching
- distributed memory machines