O⁴-DNN: A Hybrid DSP-LUT-Based Processing Unit With Operation Packing and Out-of-Order Execution for Efficient Realization of Convolutional Neural Networks on FPGA Devices.
Pouya HaghiMehdi KamalAli Afzali-KushaMassoud PedramPublished in: IEEE Trans. Circuits Syst. I Regul. Pap. (2020)