CNNBooster: Accelerating CNN Inference with Latency-aware Channel Pruning for GPU.
Yuting ZhuHongxu JiangRunhua ZhangYonghua ZhangDong DongPublished in: ISPA/BDCloud/SocialCom/SustainCom (2022)
Keyphrases
- heterogeneous computing
- real time
- search space
- cellular neural networks
- multi channel
- parallel processing
- probabilistic inference
- pruning method
- data transfer
- wireless broadcast
- pruning algorithms
- gpu implementation
- channel coding
- neural network
- graphics hardware
- pruning algorithm
- communication channels
- ofdm system
- error correction
- bayesian inference
- bayesian networks