Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training.
Yuanxin LiuFandong MengZheng LinPeng FuYanan CaoWeiping WangJie ZhouPublished in: NAACL-HLT (2022)
Keyphrases
- learning algorithm
- supervised learning
- online learning
- recurrent networks
- learning speed
- learning transfer
- training set
- learning process
- motor skills
- learning machines
- learning phase
- learning problems
- online training
- learning stage
- stochastic gradient descent
- radial basis function network
- data sets
- learning systems
- knowledge acquisition
- prior knowledge
- neural network