A GEMM interface and implementation on NVIDIA GPUs for multiple small matrices.
Chetan JhuraniPaul MullowneyPublished in: J. Parallel Distributed Comput. (2015)
Keyphrases
- graphics processing units
- graphics processors
- graphics cards
- general purpose
- efficient implementation
- database systems
- user friendly
- parallel processing
- database
- small sized
- user interface
- parallel implementation
- compute unified device architecture
- gpu implementation
- web services
- website
- information systems
- genetic algorithm
- real time