Login / Signup
Runtime Programmable and Memory Bandwidth Optimized FPGA-Based Coprocessor for Deep Convolutional Neural Network.
Nimish Shah
Paragkumar Chaudhari
Kuruvilla Varghese
Published in:
IEEE Trans. Neural Networks Learn. Syst. (2018)
Keyphrases
</>
convolutional neural network
memory bandwidth
face detection
level parallelism
instruction set
low cost
application specific
general purpose
floating point
neural network
massively parallel
hardware implementation
processing units
object detection
parallel programming