Bridging OpenCL and CUDA: a comparative analysis and translation.
Junghyun KimThanh Tuan DaoJaehoon JungJinyoung JooJaejin LeePublished in: SC (2015)
Keyphrases
- parallel programming
- shared memory
- compute unified device architecture
- general purpose
- machine translation
- graphics processing units
- parallel computing
- parallel algorithm
- gpu implementation
- cross language information retrieval
- parallel implementation
- gpu accelerated
- neural network
- statistical machine translation
- machine translation system
- times faster
- social networks
- data mining