Length-Adaptive Distillation: Customizing Small Language Model for Dynamic Token Pruning.

Published in: EMNLP (Findings) (2023)

Keyphrases