Login / Signup
KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation.
Marzieh S. Tahaei
Ella Charlaix
Vahid Partovi Nia
Ali Ghodsi
Mehdi Rezagholizadeh
Published in:
CoRR (2021)
Keyphrases
</>
language model
learning algorithm
learning process
prior knowledge
language modeling
information retrieval
probabilistic model
language modelling
context sensitive
n gram
statistical language models
speech recognition
knn
active learning
query expansion
unsupervised learning
supervised learning
relevance model