TDC: Towards Extremely Efficient CNNs on GPUs via Hardware-Aware Tucker Decomposition.
Lizhi XiangMiao YinChengming ZhangAravind Sukumaran-RajamP. SadayappanBo YuanDingwen TaoPublished in: CoRR (2022)
Keyphrases
- real time
- parallel architectures
- low cost
- cost effective
- graphics processors
- hardware and software
- computer systems
- graphics hardware
- database
- computing systems
- efficient implementation
- parallel execution
- data sets
- graphics processing units
- computational power
- image processing
- computationally expensive
- computationally efficient
- neural network
- multiresolution
- multiscale