Login / Signup

A Task-Efficient Gradient Guide Knowledge Distillation for Pre-train Language Model Compression.

Xu LiuYila SuNier Wu
Published in: ICIC (LNAI 3) (2024)
Keyphrases