Login / Signup
MS-BERT: A Multi-layer Self-distillation Approach for BERT Compression Based on Earth Mover's Distance.
Jiahui Huang
Bin Cao
Jiaxing Wang
Jing Fan
Published in:
CollaborateCom (2) (2021)
Keyphrases
</>
data mining
multi layer
feed forward neural networks
neural network
error back propagation
neural nets
single layer
image compression
information technology
multiple layers
group sparse coding