Exploring OpenMP GPU Offloading for Implementing Convolutional Neural Networks.
Kewei YanYaying ShiYonghong YanPublished in: PMAM@PPoPP (2023)
Keyphrases
- convolutional neural networks
- graphics processing units
- parallel programming
- convolutional network
- parallel computing
- shared memory
- real time
- efficient implementation
- parallel processing
- general purpose
- parallel implementation
- high performance computing
- gpu implementation
- multi core processors
- database
- graphics processors
- information systems
- parallel execution
- gpu accelerated
- parallel algorithm
- parallel architectures
- parallel computation
- cluster of workstations