Exploring OpenMP GPU Offloading for Implementing Convolutional Neural Networks.

Kewei Yan Yaying Shi Yonghong Yan

Published in: PMAM@PPoPP (2023)

Keyphrases

convolutional neural networks
graphics processing units
parallel programming
convolutional network
parallel computing
shared memory
real time
efficient implementation
parallel processing
general purpose
parallel implementation
high performance computing
gpu implementation
multi core processors
database
graphics processors
information systems
parallel execution
gpu accelerated
parallel algorithm
parallel architectures
parallel computation
cluster of workstations