Login / Signup
DropCompute: simple and more robust distributed synchronous training via compute variance reduction.
Niv Giladi
Shahar Gottlieb
Moran Shkolnik
Asaf Karnieli
Ron Banner
Elad Hoffer
Kfir Yehuda Levy
Daniel Soudry
Published in:
CoRR (2023)
Keyphrases
</>
variance reduction
gradient estimation
distributed systems
training process
machine learning
training data
monte carlo
bias variance decomposition
image sequences
video sequences
markov chain
training samples
dynamic systems