Flexible and Efficient Convolutional Acceleration on Unified Hardware Using the Two-Stage Splitting Method and Layer-Adaptive Allocation of 1-D/2-D Winograd Units.
Chen YangYaoyao YangYishuo MengKaibo HuoSiwei XiangJianfei WangLi GengPublished in: IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. (2024)