Login / Signup

Joint structured pruning and dense knowledge distillation for efficient transformer model compression.

Baiyun CuiYingming LiZhongfei Zhang
Published in: Neurocomputing (2021)
Keyphrases