Login / Signup
AdaDS: Adaptive data selection for accelerating pre-trained language model knowledge distillation.
Qinhong Zhou
Peng Li
Yang Liu
Yuyang Guan
Qizhou Xing
Ming Chen
Maosong Sun
Yang Liu
Published in:
AI Open (2023)
Keyphrases
</>
language model
data sets
prior knowledge
knowledge discovery
training data
machine learning
information retrieval
feature space
probabilistic model
feature selection
active learning
training samples
n gram
pre trained