Login / Signup
Further Boosting BERT-based Models by Duplicating Existing Layers: Some Intriguing Phenomena inside BERT.
Wei-Tsung Kao
Tsung-Han Wu
Po-Han Chi
Chun-Cheng Hsieh
Hung-yi Lee
Published in:
CoRR (2020)
Keyphrases
</>
statistical models
diffusion models
data sets
machine learning
artificial intelligence
search engine
computer vision
parameter estimation
complex systems
computational models
neural network
probabilistic model
statistical model