Login / Signup
Curriculum Learning: A Regularization Method for Efficient and Stable Billion-Scale GPT Model Pre-Training.
Conglong Li
Minjia Zhang
Yuxiong He
Published in:
CoRR (2021)
Keyphrases
</>
prior knowledge
learning process
supervised learning
machine learning
image processing
image sequences
bayesian networks
multiscale
objective function
probabilistic model
maximum likelihood
particle swarm optimization
high order