Frequency Domain Acceleration of Convolutional Neural Networks on CPU-FPGA Shared Memory System.
Chi ZhangViktor K. PrasannaPublished in: FPGA (2017)
Keyphrases
- frequency domain
- shared memory
- convolutional neural networks
- parallel architecture
- memory access
- parallel computing
- field programmable gate array
- multithreading
- compute unified device architecture
- message passing
- parallel algorithm
- spatial domain
- fourier transform
- distributed memory
- feature extraction
- graphics processing units
- graphic processing unit
- cross correlation
- hardware implementation
- parallel computers
- parallel programming
- hardware design
- denoising
- parallel architectures
- parallel machines
- fast fourier transform
- massively parallel
- three dimensional
- subband
- image compression
- signal processing
- distributed systems
- graphical models
- probabilistic model
- multiresolution
- shared memory multiprocessors