Login / Signup
Joint structured pruning and dense knowledge distillation for efficient transformer model compression.
Baiyun Cui
Yingming Li
Zhongfei Zhang
Published in:
Neurocomputing (2021)
Keyphrases
</>
conceptual model
expert knowledge
real world
computational model
statistical model
prior knowledge
knowledge representation
mathematical model
neural network
knowledge base
high level
knowledge discovery
management system
parameter estimation
structured data
formal model