Adaptive Gradient Quantization for Data-Parallel SGD.
Fartash FaghriIman TabrizianIlia MarkovDan AlistarhDaniel M. RoyAli Ramezani-KebryaPublished in: CoRR (2020)
Keyphrases
- data analysis
- data sets
- raw data
- high dimensional data
- database
- data collection
- statistical analysis
- data structure
- machine learning
- data distribution
- prior knowledge
- databases
- input data
- data processing
- sensor data
- computer systems
- parallel processing
- data mining techniques
- knowledge discovery
- data points
- probability distribution
- recommender systems
- training data
- social networks