Exploration of OpenCL 2D Convolution Kernels on Intel FPGA, CPU, and GPU Platforms.
Zheming JinHal FinkelPublished in: IEEE BigData (2019)
Keyphrases
- graphics processing units
- field programmable gate array
- general purpose
- parallel computing
- real time
- gpu implementation
- massively parallel
- parallel processing
- graphics hardware
- parallel computation
- parallel programming
- compute unified device architecture
- computer architecture
- computing systems
- parallel implementation
- graphics processors
- hardware implementation
- efficient implementation
- shared memory
- floating point
- computing platform
- multi core processors
- hardware design
- high speed
- multi threaded
- low cost
- high performance computing
- parallel architectures
- graphic processing unit
- multithreading
- parallel machines
- heterogeneous computing
- parallel architecture
- signal processing
- single instruction multiple data
- level parallelism
- memory bandwidth
- cpu implementation
- processing units
- verilog hdl
- hardware architectures
- parallel hardware
- fpga device
- commodity hardware
- data analytics
- software implementation
- parallel algorithm