Login / Signup
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity.
William Fedus
Barret Zoph
Noam Shazeer
Published in:
CoRR (2021)
Keyphrases
</>
statistical models
data sets
databases
high dimensional
probabilistic model
cost effective
computationally expensive
parameter settings
artificial neural networks
process model