Login / Signup
Sparse is Enough in Scaling Transformers.
Sebastian Jaszczur
Aakanksha Chowdhery
Afroz Mohiuddin
Lukasz Kaiser
Wojciech Gajewski
Henryk Michalewski
Jonni Kanerva
Published in:
CoRR (2021)
Keyphrases
</>
sparse representation
high dimensional
sparse data
compressive sensing
dictionary learning
neural network
machine learning
high dimension
data sets
knowledge base
training data
dimensionality reduction