LoCo: Low-Bit Communication Adaptor for Large-scale Model Training.
Xingyu XieZhijie LinKim-Chuan TohPan ZhouPublished in: CoRR (2024)
Keyphrases
- mathematical model
- high level
- probabilistic model
- experimental data
- theoretical framework
- search engine
- decision making
- small scale
- similarity measure
- multiscale
- prediction model
- theoretical analysis
- restricted boltzmann machine
- machine learning
- simulation model
- em algorithm
- supervised learning
- probability distribution
- cost function
- knowledge base
- feature selection