Login / Signup
AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training.
Chia-Yu Chen
Jungwook Choi
Daniel Brand
Ankur Agrawal
Wei Zhang
Kailash Gopalakrishnan
Published in:
CoRR (2017)
Keyphrases
</>
data collection
data sets
high quality
synthetic data
training data
training dataset
database
small number
knowledge discovery
data analysis
data structure
probability distribution
input data
training examples
missing data
data reduction
genetic algorithm