Login / Signup

GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model.

Shicheng TanWeng Lam TamYuanchun WangWenwen GongShu ZhaoPeng ZhangJie Tang
Published in: ACL (industry) (2023)
Keyphrases