Login / Signup
No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language Models.
Jean Kaddour
Oscar Key
Piotr Nawrot
Pasquale Minervini
Matt J. Kusner
Published in:
NeurIPS (2023)
Keyphrases
</>
language model
document retrieval
language modeling
smoothing methods
probabilistic model
n gram
test collection
context sensitive