Login / Signup
Scale Efficiently: Insights from Pretraining and Finetuning Transformers.
Yi Tay
Mostafa Dehghani
Jinfeng Rao
William Fedus
Samira Abnar
Hyung Won Chung
Sharan Narang
Dani Yogatama
Ashish Vaswani
Donald Metzler
Published in:
ICLR (2022)
Keyphrases
</>
scale space
database
similarity measure
bayesian networks
highly efficient
data sets
neural network
real world
data mining
image processing
multimedia
database systems
wide range
trade off
prune the search space
theoretical insights