Login / Signup
Investigating Low-Rank Training in Transformer Language Models: Efficiency and Scaling Analysis.
Xiuying Wei
Skander Moalla
Razvan Pascanu
Caglar Gulcehre
Published in:
CoRR (2024)
Keyphrases
</>
feature selection
language model
low rank
language modeling
convex optimization
machine learning
probabilistic model
information retrieval
neural network
n gram
query expansion
matrix factorization
missing data
data analysis
matrix completion
linear combination
rank minimization
high dimensional data
data mining