Maximizing Communication Efficiency for Large-scale Training via 0/1 Adam.
Yucheng LuConglong LiMinjia ZhangChristopher De SaYuxiong HePublished in: ICLR (2023)
Keyphrases
- small scale
- high scalability
- real world
- training process
- computational efficiency
- test set
- share information
- training samples
- supervised learning
- real life
- training set
- training data
- small number
- data sets
- information sharing
- communication networks
- highly efficient
- training phase
- face recognition
- data mining
- hearing impaired