Fast implementation of DGEMM on Fermi GPU.
Guangming TanLinchuan LiSean TriechleEverett H. PhillipsYungang BaoNinghui SunPublished in: SC (2011)
Keyphrases
- real time
- parallel implementation
- cluster of workstations
- graphics cards
- graphics processing units
- general purpose
- databases
- implementation details
- decision making
- multiscale
- parallel processing
- efficient implementation
- hardware implementation
- learning algorithm
- real world
- implementation issues
- parallel computation
- database