Sign in

Token-Scaled Logit Distillation for Ternary Weight Generative Language Models.

Minsoo KimSihwa LeeJanghwan LeeSukjin HongDu-Seong ChangWonyong SungJungwook Choi
Published in: CoRR (2023)
Keyphrases