Analyzing OpenCL 2.0 workloads using a heterogeneous CPU-GPU simulator.
Li WangRen-Wei TsaiShao-Chung WangKun-Chih ChenPo-Han WangHsiang-Yun ChengYi-Chung LeeSheng-Jie ShuChun-Chieh YangMin-Yih HsuLi-Chen KanChao-Lin LeeTzu-Chieh YuRih-Ding PengChia-Lin YangYuan-Shin HwangJenq Kuen LeeShiao-Li TsaoMing OuhyoungPublished in: ISPASS (2017)
Keyphrases
- graphics processing units
- general purpose
- gpu implementation
- heterogeneous computing
- graphics processors
- real time
- computing systems
- graphics hardware
- parallel processing
- floating point
- parallel programming
- parallel computation
- parallel implementation
- parallel computing
- database systems
- efficient implementation
- compute unified device architecture
- high performance computing
- computer systems
- real world
- simulation model
- processing units
- information systems
- heterogeneous networks
- parallel machines
- cpu implementation
- memory bandwidth
- massively parallel
- compute intensive
- scientific computing
- parallel architectures
- access patterns
- neural network